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HOMOLOGOUS 28-KILODALTON IMMUNODOMINANT PROTEIN 
GENES OF EHRLICHIA CANIS AND USES THEREOF 



BACKGROUND OF THE INVENTION 

Field nf the rnvpnrion 

The present invention relates generally to the field of 
molecular biology. More specifically, the present invention relates to 
molecular cloning and characterization of homologous 28-kDa 
protein genes in Ehrlichia canis and a multigene locus encoding the 
28-kDa homologous proteins of Ehrlichia canis and uses thereof. 

Description nf \hp. RelatpH A^f 

Canine ehrlichiosis, also known as canine tropical 
pancytopenia, is a tick-borne rickettsial disease of dogs first described 
in Africa in 1935 and the United States in 1963 (Donatien and 
Lestoquard, 1935; Ewing, 1963). The disease became better 
recognized after an epizootic outbreak occurred in United States 
military dogs during the Vietnam War (Walker et ai, 1970) 

The etiologic agent of canine ehrlichiosis is Ehrlichia canis, 
a small, gram-negative, obligate intracellular bacterium which exhibits 
tropism for mononuclear phagocytes (Nyindo ei al., 1971) and is 
transmitted by the brown dog tick, Rhipicephalus sanguineus (Groves 
et al., 1975). The progression of canine ehrlichiosis occurs in three 
phases, acute, subclinical and chronic. The acute phase is 
characterized by fever, anorexia, depression, lymphadenopathy and 
mild thrombocytopenia (Troy and Forrester, 1990). Dogs typically 

1 
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recover from ihe acute phase, but become persistently infected 
carriers of the organism without clinical signs of disease for months 
or even years (Harrus et al.. 1998). A chronic phase develops in 
some cases that is characterized by thrombocytopenia, 
5 hyperglobulinemia. anorexia, emaciation, and hemorrhage, 
particularly epistaxis, followed by death (Troy and Forrester, 1990). 

Molecular taxonomic analysis based on the 16S rRNA gene 
has determined that E. cams and E. chaffeensis, ihe etiologic agent of 
human monocytic ehrlichiosis (HME), are closely related (Anderson et 
0 al.. 1991; Anderson et al., 1992; Dawson et at., 1991; Chen et al.. 
1994). Considerable cross reactivity of the 64. 47. 40, 30, 29 and 23- 
kDa antigens between £. canis and £. chaffeensis has been reported 
(Chen et al., 1994; Chen et al., 1997; Rikihisa et al.. 1994; Rikihisa et 
al., 1992). Analysis of immunoreactive antigens with human and 
canine convalescent phase sera by immunoblot has resulted in the 
identification of numerous immunodominant proteins of E. canis, 
including a 30-kDa protein (Chen et al., 1997). In addition, a 30-kDa 
protein of E. canis has been described as a major immunodominant 
antigen recognized early in the immune response that is antigenically 
distinct from the 30-kDa protein of £ chaffeensis (Rikihisa et al., 
1992; Rikihisa et al., 1994). Other immunodominant proteins of £ 
canis with molecular masses ranging from 20 to 30-kDa have also 
been identified (Brouqui et al, 1992; Nyindo et al.. 1991; Chen et al., 
1994; Chen et al., 1997). 

Recently, cloning and sequencing of a multigene family 
(omp-I) encoding proteins of 23 to 28-kDa have been described for £ 
chaffeensis (Ohashi et al.. 1998). The 28-kDa immunodominant outer 
membrane protein gene {p28) of £ chaffeensis, homologous to the 
Cowdria ruminantium map- 1 gene, was cloned. Mice immunized with 
recombinant P28 were protected against challenge infection with the 
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homologous strain according to PCR analysis of periperal blood 5 days 
after challenge (Ohashi et al., 1998). Molecular cloning of two 
similar, but nonidentical, tandemly arranged 28-kDa genes of E. canis 
homologous to E. chaffeensis omp-I gene family and C. rumanintium 
5 map- 1 gene has also been reported (Reddy et ai, 1998). 

The prior art is deficient in the lack of cloning and 
characterization of new homologous 28.-kDa immunorcactive protein 
genes of Ehrlichia canis and a single multigene locus containing the 
homologous 28-kDa protein genes. Further, The prior art is deficient 
10 in the lack of recombinant proteins of such immunorcactive genes of 
Ehrlichia canis. The present invention fulfills this long-standing need 
and desire in the art. 



SUMMARY OF THE INVENTION 

15 

The present invention describes the molecular cloning, 
sequencing, characterization, and expression of homologous mature 
28-kDa immunorcactive protein genes of Ehrlichia canis (designated 
Eca28-I, ECa28SA3 and ECa28SA2), and the identification of a single 

20 locus (5.592-kb) containing five 28-kDa protein genes of Ehrlichia 
canis (ECa28SAl, ECa28SA2, ECa28SA3. Eca28-l and ECa28-2). 
Comparison with £ chaffeensis and among E. canis 28-kDa protein 
genes revealed that ECa28-l shares the most amino acid homology 
with the £. chaffeensis omp-l multigene family and is highly conserved 

25 among E. canis isolates. The five 28-kDa proteins were predicted to 
have signal peptides resulting in mature proteins, and had amino acid 
homology ranging from 51 to 72%. Analysis of intergenic regions 
revealed hypothetical promoter regions for each gene, suggesting that 
these genes may be independently and differentially expressed. 

30 Intergenic noncoding regions ranged in size from 299 to 355-bp, and 

3 
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were 48 to 71% homologous. 

In one embodiment of the present invention, there are 
provided DNA sequences encoding a 30-kDa immunoreactive protein 
of Ehrlichia canis. Preferably, the protein has an amino acid sequence 
5 selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and 
SEQID No. 6, and the gene has a nucleic acid sequence selected from 
the group consisting of SEQ ID No. 1. SEQ ID No. 3 and SEQ ID No. 5 
and is a member of a polyrhorphic multiple gene family. Generally, 
the protein has an N-terminal signal sequence which is cleaved after 
10 post-translaiiona! process resulting in the production of a mature 28- 
kDa protein. Still preferably, the DNAs encoding 28-kDa proteins are 
contained in a single multigene locus, which has the size of 5.592 kb 
and encodes all five homologous 28-kDa proteins of Ehrlichia canis. 

In another embodiment of the present invention, there is 
15 provided an expression vector comprising a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and capable of expressing 
the gene when the vector is introduced into a cell. 

In still another embodiment of the present invention, there 
is provided a recombinant protein comprising an amino acid sequence 
20 selected from the group consisting of SEQ ID No. 2, SEQIDNo. 4 and 
SEQ ID No. 6. Preferably, the amino acid sequence is encoded by a 
nucleic acid sequence selected from the group consisting of SEQ ID 
No. I, SEQ ID No. 3 and SEQ ID No. 5. Preferably, the recombinant 
protein comprises four variable regions which are surface exposed, 
25 hydrophilic and antigenic. The recombinant protein may be useful as 
an antigen. 

In yet another embodiment of the present invention, there 
is provided a method of producing the recombinant protein, 
comprising the steps of obtaining a vector that comprises an 
30 expression region comprising a sequence encoding the amino acid 
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sequence selected from the group consisting of SEQ ID No. 2. SEQ ID 
No. 4 and SEQ ID No, 6 operatively linked to a promoter; trans fee ting 
the vector into a cell; and culturing the cell under conditions effective 
for expression of the expression region. 
5 The invention may also be described in certain 

embodiments as a method of inhibiting Ehrlichia canis infection in a 
subject comprising the steps of: identifying a subject suspected of 
being exposed to or infected with Ehrlichia canis\ and administering a 
composition comprising a 28-kDa antigen of Ehrlichia canis in an 
10 amount effective to inhibit an Ehrlichia ca/iu infection. The inhibition 
may occur through any means such as, i.e. the stimulation of the 
subject's humoral or cellular immune responses, or by other means 
such as inhibiting the normal function of the 28-kDa antigen, or even 
competing with the antigen for interaction with some agent in the 
15 subject's body. 

Other and further aspects, features, and advantages of the 
present invention will be apparent from the following description of 
the presently preferred embodiments of the invention given for the 
purpose of disclosure. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 
25 become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above may 
be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 



5 
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drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

Figure 1 shows nucleic acid sequence (SEQ ID No. I) and 
deduced amino acid sequence (SEQ ID No. 2) of ECa28-I gene 
5 including adjacent 5' and 3' non-coding sequences. The ATG start 
codon and TAA termination are shown in bold, and the 23 amino acid 
leader signal sequence is underlined. 

Figure 2 shows SDS-PAGE of expressed SO-kDa 
recombinant ECa28-l-thioredoxin fusion protein (Lane 1, arrow) and 
0 16-kDa thioredoxin control (Lane 2, arrow), and corresponding 
immunoblot of recombinant ECa28-l -thioredoxin fusion protein 
recognized by covalescent-phase £ canis canine antiserum (Lane 3). 
Thiroredoxin control was not detected by E.canis antiserum (not 
shown). 

Figure 3 shows alignment of ECa28-l protein (SEQ ID NO. 
2), and ECa28SA2 (partial sequence, SEQ ID NO. 7) and EGa28SAl (SEQ 
ID NO. 8), £. chaff eensis P28 (SEQ ID NO. 9). £. chaff eensis OMP-1 
family (SEQ ID NOs: 10-14) and C ruminantium MAP-1 (SEQ ID NO. 
15) amino acid sequences. The ECa28-I amino acid sequence is 
presented as the consensus sequence. Amino acids not shown are 
identical to ECa28-l and are represented by a dot. Divergent amino 
acids are shown with the corresponding one letter abbreviation. Gaps 
introduced for maximal alignment of the amino acid sequences arc 
denoted with a dash. Variable regions are underlined and denoted 
(VRl, VR2, VR3, and VR4). The arrows indicate the predicted signal 
peptidase cleavage site for the signal peptide. 

Figure 4 shows phylogenetic relatedness of £. canis ECa28- 
1 with the ECa28SA2 (partial sequence) and ECa28SAI, 6 members of 
the E.chaffeensis omp-I multiple gene family, and C. rumanintium 
map-1 from deduced amino acid sequences utilizing unbalanced tree 
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construction. The length of each pair of branches represents the 
distance between the amino acid sequence of the pairs. The scaJe 
measures the distance between sequences. 

Figure 5 shows Southern blot analysis of E canis genomic 
5 DNA completely digested with six individual restriction enzymes and 
hybridized with a ECa28-I DIG-labeled probe (Lanes 2-7); DIG-Iabeled 
molecular weight markers (Lanes 1 and 8). 

Figure 6 shows comparison of predicted protein 
characteristics of ECa28-l (Jake strain) and £. chaffeensis P28 
10 (Arkansas strain). Surface probability predicts the surface residues 
by using a window of hexapeptide. A surface residue is any residue 
with a >2.0 nm^ of water accessible surface area. A hexapeptide with 
a value higher than 1 was considered as surface region. The antigenic 
index predicts potential antigenic determinants. The regions with a 
15 value above zero are potential antigenic determinants. T-cell motif 
locates the potential T-cell antigenic determinants by using a motif of 
5 amino acids with residue 1-glycine or polar, residue 2-hydrophobic. 
residue 3-hydrophobic. residue 4-hydrophobic or proline, and residue 
5-polar or glycine. The scale indicates amino acid positions. 
20 Figure 7 shows nucleic acid sequences and deduced 

amino acid sequences of the £. canis 28-kDa protein genes ECa28SA2 
(nucleotide 1-849: SEQ ID No. 3; amino acid sequence: SEQ ID No. 4) 
and ECa28SA3 (nucleotide 1195-2031: SEQ ID No. 5; amino acid 
sequence: SEQ ID No. 6) including intergenic noncoding sequences 
25 (NC2. nucleotide 850-1194: SEQ ID No. 31). The ATG start codon and 
termination condons are shown in bold. 

Figure 8 shows schematic of the five £ canis 28-kDa 
protein gene locus (5.592-Kb) indicating genomic orientation and 
intergenic noncoding regions (28NC1-4). The 28-kDa protein genes 
30 shown in Locus I and 2 (shaded) have been described (McBride et al.. 
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1999; Reddy ei uL, 1998: Ohashi et aL. 1998). The complete sequence 
of ECaSA2 and a new 28-kDa protein gene designated {ECa28SA3 - 
unshaded) was sequenced. The noncoding intergenic regions (28NC2- 
3) between ECaSAl. ECa28SA3 and ECa28'I were completed joining the 
previously unlinked loci 1 and 2. 

Figure 9. shows phylogenetic relaiedness of the five E 
canis 28-kDa protein gene members based on amino acid sequences 
utilizing unbalanced tree construction. The length of each pair of 
branches represents the distance between amino acid pairs. The scale 
measures the distance bete ween sequences. 

Figure 10 shows alignment of £. canis 28-kDa protein 
gene intergenic noncoding nucleic acid sequences (SEQ ID Nos. 30- 
33). Nucleic acids not shown, denoted with a dot (.), are identical to 
noncoding region 1 (28NC1). Divergence is shown with the 
corresponding one letter abbreviation. Gaps introduced for maximal 
alignment of the amino acid sequences are denoted with a dash (-). 
Putative transcriptional promoter regions (-10 and -35) and ribosomal 
binding site (RBS) are boxed. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention describes cloning, sequencing and 
expression of homologous genes encoding a 30-kilodalton (kDa) 
protein of Ehrlichia canis. A comparative molecular analysis of 
homologous genes among seven E. canis isolates and the £ chaffeensis 
omp'l multigene family was also performed. Two new 28-kDa protein 
genes are identified, ECa2S'I and ECa28SA3, ECa28-I has an 834-bp 
open reading frame encoding a protein of 278 amino acids (SEQ ID 
No. 2) with a predicted molecular mass of 30.5-kDa. An N-terminal 
signal sequence was identified suggesting that the protein is post- 

8 



wo 00/32745 PCT/US99/28075 

translationally modified to a maiurc protein of 27.7-kDa. ECa28SA3 
has an 840-bp open reading frame encoding a 280 amino acid protein 
(SEQ ID No. 6). 

Using PCR to amplify 28-kDa protein genes of E. cants, a 
5 previously unsequenced region of Eca28SA2 was completed. Sequence 
analysis of ECa28SA2 revealed an 849-bp open reading frame encoding 
a 283 amino acid protein (SEQ ID No. 4). PCR amplification using 
primers specific foi: 28-lcDa protein gene intergenic noncoding regions 
linked two previously separate loci, identifying a single locus (5.592- 
0 kb) containing all five 28-kDa protein genes. The five 28-kDa 
proteins were predicted to have signal peptides resulting in mature 
proteins, and had amino acid homology ranging from 51 to 72%, 
Analysis of intergenic regions revealed hypothetical promoter regions 
for each gene, suggesting that these genes may be independently and 
5 differentially expressed. Intergenic noncoding regions (28NC1-4) 
ranged in size from 299 to 355-bp, and were 48 to 71% homologous. 

The present invention is directed to two new homologous 
28-kDa protein genes in Ehrlichia canis, Eca28-l and ECa28SA3, and a 
complete sequence of previously partially sequenced ECa28SA2. Also 
disclosed is a multigene locus encoding all five homologous 28-kDa 
outer membrane proteins of Ehrlichia canis. 

In one embodiment of the present invention, there are 
provided DNA sequences encoding a 30-kDa immunoreactive protein 
of Ehrlichia canis. Preferably, the protein has an amino acid sequence 
selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and 
SEQ ID No. 6, and the gene has a nucleic acid sequence selected from 
the group consisting of SEQ ID No. L SEQ ID No. 3 and SEQ ID No. 5 
and is a member of a polymorphic multiple gene family. More 
preferably, the protein has an N-tcrminal signal sequence which is 
cleaved after post-translational process resulting in the production of 
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a mature 28-k:Da protein. Still preferably, the DNAs encoding 28-lcDa 
proteins are contained in a single muliigene locus, which has the size 
of 5.592 kb and encodes all five homologous- 28-kDa proteins of 
Ehrlichia canis. 

In another embodiment of the present invention, there is 
provided an expression vector comprising a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and capable of expressing 
the gene when the vector is introduced into a cell. 

In still another embodiment of the present invention, there 
is provided a recombinant protein comprising an amino acid sequence 
selected from the group consisting of SEQ ID No. 2. SEQ ID No. 4 and 
SEQ ID No. 6, Preferably, the amino acid sequence is encoded by a 
nucleic acid sequence selected from the group consisting of SEQ ID 
No. U SEQ ID No. 3 and SEQ ID No. 5. Preferably, the recombinant 
protein comprises four variable regions which are surface exposed, 
hydrophilic and antigenic. Still preferably, the recombinant protein is 
an antigen. 

In yet another embodiment of the present invention, there 
is provided a method of producing the recombinant protein, 
comprising the steps of obtaining a vector that comprises an 
expression region comprising a sequence encoding the amino acid 
sequence selected from the group consisting of SEQ ID No. 2, SEQ ID 
No. 4 and SEQ ID No. 6 operatively linked to a promoter; transfecting 
the vector into a cell; and culturing the cell under conditions effective 
for expression of the expression region. 

The invention may also be described in certain 
embodiments as a method of inhibiting Ehrlichia canis infection in a 
subject comprising the steps of: identifying a subject suspected of 
being exposed to or infected with Ehrlichia canis: and administering a 
composition comprising a 28-kDa antigen of Ehrlichia canis in an 

10 
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amount effeciive lo inhibit an Ehrlichia canis infection. The inhibition 
may occur through any means such as, i.e. the stimulation of the 
subject*s humoral or cellular immune responses, or by other means 
such as inhibiting the normal function of the 28-kDa antigen, or even 
5 competing with the antigen for interaction with some agent in the 
subject's body. 

In accordance with the present invention there may be 
employed conventional molecular biology. microbiology, and 
recombinant DNA techniques within the skill of the art. Such 

10 techniques are explained fully in the literature. See, e.g., Maniatis, 
Fritsch & Sambrook. "Molecular Cloning: A Laboratory Maniial (1982); 
"DNA Cloning: A Practical Approach," Volumes I and II (D.N. Glover ed. 
1985); "Oligonucleotide Synthesis*' (M.J. Gait ed. 1984); "Nucleic Acid 
Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; "Transcription 

15 and Translation" [B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell 
Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" 
[IRL Press, (1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" 
(1984). 

Therefore, if appearing herein, the following terms shall 
20 have the definitions set out below. 

A "replicon" is any genetic element (e.g., plasmid, 
chromosome, virus) that functions as an autonomous unit of DNA 
replication in vivo\ i.e., capable of replication under its own control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, 
25 to which another DNA segment may be attached so as to bring about 
the replication of the attached segment. 

A "DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in its 
either single stranded form, or a double-stranded helix. This term 
30 refers only to the primary and secondary structure of the molecule, 

1 1 



wo 00/32745 PCT/US99/28075 

and does not limit it to any particular tertiary forms. Thus, this term 
includes double-stranded DNA found, inter alia, in linear DNA 
molecules (e.g., restriction fragments), viruses, plasmids, and 
chromosomes. In discussing the structure herein according to the 
5 normal convention of giving only the sequence in the 5' to 3' direction 
along the nontranscribed strand of DNA (i.e., the strand having a 
sequence homologous to the mRNA). 

A DNA "coding sequence" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in 
0 vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined by 
a start codon at the 5' (amino) terminus and a translation stop codon 
at the 3* (carboxyl) terminus. A coding sequence can include, but is 
not limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, 
5 genomic DNA sequences from eukaryotic (e.g., mammalian) DNA, and 
even synthetic DNA sequences, A polyadenylation signal and 
transcription termination sequence will usually be located 3' to the 
coding sequence. 

Transcriptional and translational control sequences arc 
DNA regulatory sequences. such as promoters. enhancers, 
polyadenylation signals, terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region capable 
of binding RNA polymerase in a cell and initiating transcription of a 
downstream (3' direction) coding sequence. For purposes of defining 
the present invention, the promoter sequence is bounded at its 3 ' 
terminus by the transcription initiation .site and extends upstream (5* 
direction) to include the minimum number of bases or elements 
necessary to initiate transcription at levels detectable above 
background. Within the promoter sequence will be found a 

12 
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transcription initiation site, as well as protein binding domains 
(consensus sequences) responsible for the binding of RNA 
polymerase. Eukaryotic promoters often, but not always, contain 
"TATA" boxes and "CAT" boxes. Prokaryotic promoters contain Shine- 
5 Dalgarno sequences in addition to the -10 and -35 consensus 
sequences. 

An "expression control sequence" is a DNA sequence that 
controls and regulates the transcription and translation of another 
DNA sequence. A coding sequence is "under the control" of 
10 transcriptional and translaiional control sequeiices in a cell when RNA 
polymerase transcribes the coding sequence into mRNA, which is then 
translated into the protein encoded by the coding sequence. 

A "signal sequence" can be included near the coding 
sequence. This sequence encodes a signal peptide, N-terminal to the 
15 polypeptide, that communicates to the host cell to direct the 
polypeptide to the cell surface or secrete the polypeptide into the 
media, and this signal peptide is clipped off by the host cell before the 
protein leaves the cell. Signal sequences can be found associated with 
a variety of proteins native to prokaryotes and eukaryotes. 
20 The term "oligonucleotide", as used herein in referring to the 

probe of the present invention, is defined as a molecule comprised of 
two or more ribonucleotides, preferably more than three. Its exact 
size will depend upon many factors which, in turn, depend upon the 
ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of acting 
as a point of initiation of synthesis when placed under conditions in 
which synthesis of a primer extension product. which is 
30 complementary to a nucleic acid strand, is induced, i.e., in the 

13 
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presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer may be 
either single-stranded or double-stranded and must be sufficiently 
long to prime the synthesis of the desired extension product in the 
5 presence of the inducing agent. The exact length of the primer will 
depend upon many factors, including temperature, source of primer 
and use the method. For example, for diagnostic applications, 
depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15-25 or more nucleotides, 
0 although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" 
complementary to different strands of a particular target DNA 
sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands^ Therefore, 
5 the primer sequence need not reflect the exact sequence of the 
template. For example, a non-complementary nucleotide fragment 
may be attached to the 5' end of the primer, with the remainder of the 
primer sequence being complementary to the strand. Alternatively, 
non-complementary bases or longer sequences can be interspersed 
into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence or hybridize therewith and 
thereby form the template for the synthesis of the extension product. 

A cell has been "transformed" by exogenous or 
heterologous DNA when such DNA has been introduced inside the celL 
The transforming DNA may or may not be integrated (covalently 
linked) into the genome of the cell. In prokaryotcs, yeast, and 
mammalian cells for example, the transforming DNA may be 
maintained on an episomal element such as a plasmid. With respect 
to eukaryotic cells, a stably transformed cell is one in which the 
transforming DNA has become integrated into a chromosome so that 

14 
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it is inherited by daughter cells through chromosome replication. 
This stability is demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population of daughter 
cells containing the transforming DNA, A "clone" is a population of 
cells derived from a single cell or ancestor by mitosis. A "cell line" is 
a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

Two DNA sequences are "substantially homologous" when 
at least about 75% (preferably at least about 80%, and most 
preferably at least about 90% or 95%) of the nucleotides match over 
the defined length of the DNA sequences. Sequences that are 
substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data banks, 
or in a Southern hybridization experiment under, for example, 
stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. See, 
e.g., Maniatis et al., supra; DNA Cloning, Vols. I & II, supra; Nucleic 
Acid Hybridization, supra. 

A "heterologous' region of the DNA construct is an 
identifiable segment of DNA within a larger DNA molecule that is not 
found in association with the larger molecule in nature. Thus, when 
the heterologous region encodes a mammalian gene, the gene will 
usually be flanked by DNA that does not flank the mammalian 
genomic DNA in the genome of the source organism. In another 
example, coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., a cDNA where the genomic coding 
sequence contains introns, or synthetic sequences having codons 
different than the native gene). Allelic variations or naturally- 
occurring mutational events do not give rise to a heterologous region 
of DNA as defined herein. 

15 
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The labels most commonly employed for these studies are 
radioactive elements, enzymes, chemicals which fluoresce when 
exposed to untraviolet light, and others. A number of fluorescent 
materials are known and can be utilized as labels. These include, for 
example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue 
and Lucifer Yellow. A particular detecting material is anti-rabbit 
antibody prepared in goats and conjugated with fluorescein through 
an isothiocyanate. 

Proteins can also be labeled with a radioactive element or 
with an enzyme. The radioactive label can be detected by any of the 
currently available counting procedures. The preferred isotope may 
be selected from 3H, I4c, 32p^ 355, 36ci, 5iCr, 57Co, 58Co, 59Fe. ^oy, i25i^ 
I3II, and J86Re. 

Enzyme labels are likewise useful, and can be detected by 
any of the presently utilized colorimetric, spectre photo me trie, 
fiuorospectrophotometric, amperometric or gasometric techniques. 
The enzyme is conjugated to the selected particle by reaction with 
bridging molecules such as carbodiimides, diisocyanates, 
glutaraldehyde and the like. Many enzymes which can be used in 
these procedures are known and can be utilized. The preferred are 
peroxidase, p-glucuronidase, p-D-glucosidase, 3-D-galactosidase, 
urease, glucose oxidase plus peroxidase and alkaline phosphatase. 
U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to 
by way of example for their disclosure of alternate labeling material 
and methods. 

As used herein, the term "host" is meant to include not 
only prokaryotes but also eukaryotes such as yeast, plant and animal 
cells. A recombinant DNA molecule or gene which encodes a 28-kDa 
immunoreactive protein of Ehrlichia canis of the present invention can 
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be used to transform a host using any of the techniques commonly 
known to those of ordinary skill in the art. Especially preferred is the 
use of a vector containing coding sequences for a gene encoding a 28- 
kDa immunoreactive protein of Ehrlichia canis of the present 
invention for purposes of prokaryote transformation. 

Prokaryotic hosts may include £ coli, S, tymphimurium, 
Serratia marcescens and Bacillus subtilis. Eukaryotic hosts include 
yeasts such as Pichia pasta ris, mammalian cells and insect cells. 

In general, expression vectors containing promoter 
sequences which facilitate the efficient transcription of the inserted 
DNA fragment are used in connection with the host. The expression 
vector typically contains an origin of replication, promoter(s), 
terminator(sX as well as specific genes which are capable of providing 
phenotypic selection in transformed cells. The transformed hosts can 
be fermented and cultured according to means known in the art to 
achieve optimal cell growth. 

The invention includes a substantially pure DNA encoding 
a 28-kDa immunoreactive protein of Ehrlichia canis, a strand of which 
DNA will hybridize at high stringency to a probe containing a 
sequence of at least 15 consecutive nucleotides of SEQ ID No. 1 or SEQ 
ID No. 3 or SEQ ID No. 5. The protein encoded by the DNA of this 
invention may share at least 80% sequence identity (preferably 85%, 
more preferably 90%, and most preferably 95%) with the amino acids 
listed in SEQ ID No. 2 or SEQ ID No. 4 or SEQ ID No. 6. More 
preferably, the DNA includes the coding sequence of the nucleotides 
of SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. 5, or a degenerate 
variant of such a sequence. 

The probe to which the DNA of the invention hybridizes 
preferably consists of a sequence of at least 20 consecutive 
nucleotides, more preferably 40 nucleotides, even more preferably 5 0 
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nucleotides, and most preferably 100 nucleotides or more (up to 
100%) of the coding sequence of the nucleotides listed in SEQ ID No. 1 
or SEQ ID No. 3 or SEQ ID No. 5 or the complement thereof. Such a 
probe is useful for detecting expression of the 28-kDa 
5 immunoreactive protein of Ehrlichia canis in a human cell by a 
method including the steps of (a) contacting mRNA obtained from the 
cell with the labeled hybridization probe; and (b) detecting 
hybridization of the probe with the mRNA. 

This invention also includes a substantially pure DNA 
10 containing a sequence of at least 15 consecutive nucleotides 
(preferably 20, more preferably 30, even more preferably 50, and 
most preferably all) of the region from the nucleotides listed in SEQ ID 
No 1 or SEQ ID No. 3 or SEQ ID No. 5. 

By "high stringency" is meant DNA hybridization and wash 
15 conditions characterized by high temperature and low salt 
concentration, e.g.. wash conditions of 65''C at a salt concentration of 
approximately 0.1 x SSC, or the functional equivalent thereof. For 
example, high stringency conditions may include hybridization at 
about 42*C in the presence of about 50% formamide; a first wash at 
20 about 65X with about 2 x SSC containing 1% SDS; followed by a 
second wash at about 65°C with about 0.1 x SSC. 

By "substantially pure DNA" is meant DNA that is not part 
of a milieu in which the DNA naturally occurs, by virtue of separation 
(partial or total purification) of some or all of the molecules of that 
25 milieu, or by virtue of alteration of sequences that flank the claimed 
DNA. The term therefore includes, for example, a recombinant DNA 
which is incorporated into a vector, into an autonomously replicating 
plasmid or virus; or into the genomic DNA of a prokaryote or 
eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 
30 genomic or cDNA fragment produced by polymerase chain reaction 
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(PCR) or restriction endonuclease digestion) independent of other 
sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence, e.g., a fusion 
protein. Also included is a recombinant DNA which includes a portion 
5 of the nucleotides listed in SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. 
5 which encodes an alternative splice variant of a gene encoding a 28- 
kOa immunoreactive protein of Ehrlichia canis. 

The DNA may have at least about 70% sequence identity to 
the coding sequence of the nucleotides listed in SEQ ID No. 1 or SEQ ID 
10 No. 3 or SEQ ID No. 5, preferably at least 75% (e.g. at least 80%); and 
most preferably at least 90%. The identity between two sequences is a 
direct function of the number of matching or identical positions. 
When a subunit position in both of the two sequences is occupied by 
the same monomeric subunit, e.g., if a given position is occupied by 
15 an adenine in each of two DNA molecules, then they are identical at 
that position. For example, if 7 positions in a sequence 
10 nucleotides in length are identical to the corresponding positions 
in a second 10-nucleotide sequence, then the two sequences have 70% 
sequence identity. The length of comparison sequences will generally 
20 be at least 50 nucleotides, preferably at least 60 nucleotides, more 
preferably at least 75 nucleotides, and most preferably 
100 nucleotides. Sequence identity is typically measured using 
sequence analysis software (e.g., Sequence Analysis Software Package 
of the Genetics Computer Group, University of Wisconsin 
25 Biotechnology Center, 1710 University Avenue, Madison, WI 53705). 

The present invention comprises a vector comprising a 
DNA sequence coding for a which encodes a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and said vector is capable 
of replication in a host which comprises, in operable linkage: a) an 
30 origin of replication; b) a promoter; and c) a DNA sequence coding 
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for said protein. Preferably, the vector of the present invention 
contains a portion of the DNA sequence shown in SEQ ID No. 1 or SEQ 
ID No. 3 or SEQ ID No. 5. 

A "vector" may be defined as a replicable nucleic acid 
5 construct, e.g., a plasmid or viral nucleic acid. Vectors may be used 
to amplify and/or express nucleic acid encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis. An expression vector is a 
replicable construct in which a nucleic acid sequence encoding a 
polypeptide is operably linked to suitable control sequences capable 

10 of effecting expression of the polypeptide in a cell. The need for such 
control sequences will vary depending upon the cell selected and the 
transformation method chosen. Generally, control sequences include 
a transcriptional promoter and/or enhancer, suitable mRNA ribosomal 
binding sites^ and sequences which control the termination of 

15 transcription and translation. Methods which are well known to those 
skilled in the art can be used to construct expression vectors 
containing appropriate transcriptional and translational control 
signals. See for example, the techniques described in Sambrook et al., 
1989, Molecular Cloning: A Laboratory Manual (2nd Ed.), Cold Spring 

20 Harbor Press, N.Y. A gene and its transcription control sequences are 
defined as being "operably linked" if the transcription control 
sequences effectively control the transcription of the gene. Vectors of 
the invention include, but are not limited to, plasmid vectors and viral 
vectors. Preferred viral vectors of the invention are those derived 

25 from retroviruses, adenovirus, adeno-associated virus, SV40 virus, or 
herpes viruses. 

By a "substantially pure protein" is meant a protein which 
has been separated from at least some of those components which 
naturally accompany it. Typically, the protein is substantially pure 

30 when it is at least 60%, by weight, free from the proteins and other 
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naturally-occurring organic molecules with which it is naturally 
associated in vivo. Preferably, the purity of the preparation is at least 
75%, more preferably at least 90%, and most preferably at least 99%, 
by weight. A substantially pure 28-kDa iramunoreactive protein of 
Ehrlichia canis may be obtained, for example, by extraction from a 
natural source; by expression of a recombinant nucleic acid encoding 
a 28-kDa immunoreactive protein of Ehrlichia canis; or by chemically 
synthesizing the protein. Purity can be measured by any appropriate 
method, e.g., column chromatography such as immunoaffinity 
chromatography using an antibody specific for a 28-kDa 
immunoreactive protein of Ehrlichia canis, polyacrylamide gel 
electrophoresis, or HPLC analysis. A protein is substantially free of 
naturally associated components when it is separated from at least 
some of those contaminants which accompany it in its natural state. 
Thus, a protein which is chemically synthesized or produced in a 
cellular system different from the cell from which it naturally 
originates will be, by definition, substantially free from its naturally 
associated components. Accordingly, substantially pure proteins 
include eukaryotic proteins synthesized in E. coli, other prokaryotes, 
or any other organism in which they do not naturally occur. 

In addition to substantially full-length proteins, the 
invention also includes fragments (e.g., antigenic fragments) of the 
28-kDa immunoreactive protein of Ehrlichia canis (SEQ ID No. 2 or 
SEQ ID No. 4 or SEQ ID No. 6). As used herein, "fragment," as applied 
to a polypeptide, will ordinarily be at least 10 residues, more typically 
at least 20 residues, and preferably at least 30 (e.g., 50) residues in 
length, but less than the entire, intact sequence. Fragments of the 28- 
kDa immunoreactive protein of Ehrlichia canis can be generated by 
methods known to those skilled in the art, e.g., by enzymatic digestion 
of naturally occurring or recombinant 28-kDa immunoreactive 
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protein of Ehrlichia canis^ by recombinant DNA techniques using an 
expression vector that encodes a defined fragment of 28-kDa 
immunoreactive protein of Ehrlichia canis, or by chemical synthesis. 
The ability of a candidate fragment to exhibit a characteristic of 28- 
5 kDa immunoreactive protein of Ehrlichia canis (e.g., binding to an 
antibody specific for 28-kDa immunoreactive protein of Ehrlichia 
canis) can be assessed by methods described herein. Purified 28-kDa 
immunoreactive protein of Ehrlichia canis or antigenic fragments of 
28-kDa immunoreactive protein of Ehrlichia canis can be used to 
10 generate new antibodies or to test existing antibodies (e.g., as positive 
controls in a diagnostic assay) by employing standard protocols 
known to those skilled in the art. Included in this invention are 
polyclonal antisera generated by using 2i8-kDa immunoreactive 
protein of Ehrlichia canis or a fragment of 28-kDa immunoreactive 
15 protein of Ehrlichia canis zs the immunogen in, e.g., rabbits. Standard 
protocols for monoclonal and polyclonal antibody production known 
to those skilled in this art are employed. The monoclonal antibodies 
generated by this procedure can be screened for the ability to identify 
recombinant Ehrlichia canis cDNA clones, and to distinguish them 
20 from known cDNA clones. 

Further included in this invention are fragments of the 28- 
kDa immunoreactive protein of Ehrlichia canis which are encoded at 
least in part by portions of SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. 
5, e.g., products of alternative mRNA splicing or alternative protein 
25 processing events, or in which a section of the sequence has been 
deleted. The fragment, or the intact 28-kDa immunoreactive protein 
of Ehrlichia canis, may be covalently linked to another polypeptide, 
e.g. which acts as a label, a ligand or a means to increase antigenicity. 

The phrase ''pharmaceutically acceptable" refers to 
30 molecular entities and compositions that do not produce an allergic 
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or similar untoward reaction when administered lo a human. The 
preparation of an aqueous composition that contains a protein as an 
active ingredient is well understood in the art. Typically, such 
compositions are prepared as injectables, cither as liquid solutions or 
5 suspensions; solid forms suitable for solution in, or suspension in, 
liquid prior to injection can also be prepared. The preparation can 
also be emulsified. 

A protein may be formulated into a composition in a 
neutral or salt form. Pharmaceutically acceptable salts, include the 
10 acid addition salts (formed with the free amino groups of the protein) 
and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts formed with the free 
carboxyl groups can also be derived from inorganic bases such as, for 
15 example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylaminc, 
trimethylamine, histidine, procaine and the like. 

Upon formulation, solutions will be administered in a 
manner compatible with the dosage formulation and in such amount 
20 as is therapeutically effective. The formulations are easily 
administered in a variety of dosage forms such as injectable solutions. 

For parenteral administration in an aqueous solution, for 
example, the solution should be suitably buffered if necessary and the 
liquid diluent first rendered isotonic with sufficient saline or glucose. 
25 These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, sterile aqueous media which can 
be employed will be known lo those of skill in the art in light of the 
present disclosure. For example, one dosage could be dissolved in I 
30 mL of isotonic NaCl solution and either added to lOOOmL of 
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hypodermoclysis fluid or injected at the proposed site of infusion, 
(see for example, "Remington's Pharmaceutical Sciences" I5th Edition, 
pages 1035-1038 and 1570-1580). Some variation in dosage will 
necessarily occur depending on the condition of the subject being 
5 treated. The person responsible for administration will, in any event, 
determine the appropriate dose for the individual subject. 

As is well known in the art, a given polypeptide may vary 
in its immunogenicity. It is often necessary therefore to couple the 
immunbgen (e.g., a polypeptide of the present invention) with a 
10 carrier. Exemplary and preferred carriers are keyhole limpet 
hemocyanin (KLH) and human serum albumin. Other carriers may 
include a variety of lymphokines and adjuvants such as IL2, IL4, IL8 
and others. 

Means for conjugating a polypeptide to a carrier protein 

15 are well known in the art and include glutaraldehyde, m- 
maleimidobenzoyl-N-hydroxysuccinimide ester, carbo-diimide and 
bis-biazotized benzidine. It is also understood that the peptide may 
be conjugated to a protein by genetic engineering techniques that are 
well known in the art. 

20 As is also well known in the art, immunogenicity to a 

particular immunogen can be enhanced by the use of non-specific 
stimulators of the immune response known as adjuvants. Exemplary 
and preferred adjuvants include complete BCG, Detox, (RIBI, 
Immunochem Research Inc.) ISCOMS and aluminum hydroxide 

25 adjuvant (Superphos, Biosector). 

As used herein the term "complement" is used to define 
the strand of nucleic acid which will hybridize to the first nucleic acid 
sequence to form a double stranded molecule under stringent 
conditions. Stringent conditions are those that allow hybridization 

30 between tv/o nucleic acid sequences with a high degree of homology. 
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but precludes hybridization of random sequences. For example, 
hybridization at low temperature and/or high ionic strength is termed 
low stringency and hybridization at high temperature and/or low ionic 
strength is termed high stringency. The temperature and ionic 
5 strength of a desired stringency are understood to be applicable to 
particular probe lengths, to the length and base content of the 
sequences and to the presence of formamide in the hybridization 
mixture. 

As used herein, the term "engineered" or "recombinant" 
10 cell is intended to refer to a cell into which a recombinant gene, such 
as a gene encoding an Ehrlichia chaffeensis antigen has been 
introduced. Therefore, engineered cells are distinguishable from 
naturally occurring cells which do not contain a recombinantly 
introduced gene. Engineered cells are thus cells having a gene or 
15 genes introduced through the hand of man. Recombinantly 
introduced genes will either be in the form of a cDNA gene, a copy of 
a genomic gene, or will include genes positioned adjacent to a 
promoter not naturally associated with the particular introduced 
gene. In addition, the recombinant gene may be integrated into the 
20 host genome, or it may be contained in a vector, or in a bacterial 
genome transfected into the host cell. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant 
to limit the present invention in any fashion. 

25 

EXAMPLE 1 

Rhrlichiae and Purification 

Ehrlichia canis (Florida strain and isolates Demon, DJ, 
30 Jake, and Fuzzy) were provided by Dr. Edward Breitschwerdt, (College 

25 
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of Veterinary Medicine, North Carolina State University. Raleigh, NC). 
E. cants (Louisiana strain) was provided by Dr. Richard E. Corstvet 
(School of Veterinary Medicine, Louisiana State University, Baton 
Rouge, LA) and £. canis (Oklahoma strain) was provided by Dr. 
Jacqueline Dawson (Centers for Disease Control and Prevention, 
Atlanta, GA). Propagation of ehrlichiae was performed in DH82 cells 
with DMEM supplemented with 10% bovine calf serum and 2 mM L- 
glutamine at 37»C. The intracellular growth in DH82 cells was 
monitored by presence of £. canis morulae using general cytologic 
staining methods. Cells were harvested when 100% of the cells were 
infected with ehrlichiae and were then pelleted in a centrifuge at 
17,000 X g for 20 min. Cell pellets were disrupted with a Braun-Sonic 
2000 sonicator twice at 40W for 30 sec on ice. Ehrlichiae were 
purified as described previously (Weiss et ai, 1975). The lysate was 
loaded onto discontinuous gradients of 42%-36%-30% renografin, and 
centrifuged at 80,000 x g for 1 hr. Heavy and light bands containing 
ehrlichiae were collected and washed with sucrose-phosphatc- 
glutamate buffer (SPG, 218 mM sucrose, 3.8 mM KH2PO4, 7.2 mM 
K2HPO4, 4.9 mM glutamate, pH 7.0) and pelleted by centrifugation. 

KXAMPTF? 

Nucleic Acid Prpparatinn 

Ehrlichia canis genomic DNA was prepared by 
resuspending the renograf in-purified ehrlichiae in 600 fil of 10 mM 
Tris-HCI buffer (pH 7.5) with 1% sodium dodecyl sulfate (SDS, w/v) 
and 100 ng/ml of proteinase K as described previously (McBride et al., 
1996). This mixture was incubated for I hr at 56" C, and the nucleic 
acids were extracted twice with a mixture of 
phenol/chloroform/isoamyl alcohol (24:24:1). DNA was pelleted by 
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absolute ethanol precipitation, washed once with 70% ethanoi. dried 
and resuspended in lOmM Tris (pH 7.5). Plasmid DNA was purified by 
using High Pure Plasmid Isolation Kit (Boehringer Mannheim. 
Indianapolis, IN), and PCR products were purified using a QIAquick PCR 
Purification Kit (Qiagen, Santa Clarita, CA). 

FYAMPT.F.l 

PCR Amplification of ]hp. F. rnnii QR.V ria prntftin (Ipnpg 

Regions of the £ canis ECa28-l gene selected for PCR 
amplification were chosen based on homology observed (>90%) in the 
consensus sequence generated from Jotun-Hein aligorithm alignment 
of E. chaffeensis p28 and Cowdria ruminantium map- 1 genes. Forward 
primer 793 (5-GCAGGAGCTGTTGGTTACTC-3') (SEQ ID NO. 16) and 
reverse primer 1330 (5'-CCTTCCTCCAAGTTCTATGCC-3') (SEQ ID NO. 
17) corresponded to nucleotides 313-332 and 823-843 of C. 
ruminantium MAP-1 and 307-326 and 834-814 of E. chaffeensis P28. 
E. canis (a North Carolina isolate, Jake) DNA was amplified with 
primers 793 and 1330 with a thermal cycling profile of 95"C for 2 
min. and 30 cycles of 95»C for 30 sec, 62» C for 1 min, 72"C for 2 min 
followed by a 72°C extension for 10 min and 4°C hold. PCR products 
were analyzed on 1% agarose gels. This amplified PCR product was 
sequenced directly with primers 793 and 1330. 

Primers specific for ECa28SA2 gene designated 46f (5'- 
ATATACTTCCTACCTAATGTCTCA-3\ SEQ ID No. 18) and primer 1330 
(SEQ ID No. 17) were used to amplify the targeted region. The 
amplified product was gel purified and cloned into a TA cloning vector 
(Invitrogen, Santa Clarita, CA). The clone was sequenced 
bidirectionally with primers: Ml 3 reverse from the vector, 46 f, 
ECd283A7 (5'-ACn'GCAGAGTCTTCGGTTTC-3', SEQ ID No. 19). ECa5.3 
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(5^-GTTACTTGCGGAGGACAT-3\ SEQ ID No. 20). DNA was amplified 

with a thermal cycling profile of 95°C for 2 min, and 30 cycles of 95^C 

for 30 sec. 48**C for I min, 72^C for 1 min followed by a 72°C 
extension for 10 min and 4**C hold. 

5 

EXAMPLK4 



,Sf qHP>nring Un known 5' and V Regions of th(^ F.Ca28'l Gene 

The full length sequence of ECa28-l was determined using 

10 a Universal GenomeWalker Kit (CLONTECH, Palo Alto, CA) according to 
the protocol supplied by ihe manufacturer. Genomic £. canis (Jake 
isolate) DNA was digested completely with five restriction enzymes 
{Drah EcoRV, Pvu\\ Seal Stul) which produce biunt-ended DNA. An 
adapter (API) supplied in the kit was ligaied to each end of E, canis 

15 DNA. The genomic libraries were used as templates to find the 
unknown DNA sequence of the ECa28-l gene by PGR using a primer 
complementary to a known portion of the ECalS-l sequence and a 
primer specific for the adapter API. Priniers specific for ECa28-l used 
for genome walking were designed from the known DNA sequence 

20 derived from PGR amplification of ECa28'l with primers 793 (SEQ ID 
NO. 16) and 1330 (SEQ ID NO. 17). Primers 394 (5'- 
GCATTTCCACAGGATCATAGGTAA-3'; nucleotides 687-710, SEQ ID NO. 
21) and 394C (5*-TTACCTATGATCCTGT GGAAATGC-3; nucleotides 
710-687, SEQ ID NO. 22) were used in conjunction with supplied 

25 primer API to amplify the unknown 5' and 3* regions of the ECa28-I 
gene by PGR. A PGR product corresponding to the 5' region of the 
ECa28'l gene amplified with primers 394C and API (2000-bp) was 
sequenced unidirectionally with primer 793C (5'-GAGTA 
ACCAACAGCTCCTGC-T, SEQ ID No. 23). A PGR product corresponding 
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to the 3' region of the ECa28-l gene amplified with primers 394 and 
API (580-bp) was sequenced bidirectionally with the same primers. 
Noncoding regions on the 5' and 3* regions adjacent to the open 
reading frame were sequenced, and primers EC280M-F (5*- 
5 TCTACITTGCACTtCC ACTATTGT-3\ SEQ ID NO. 24) and EC280M-R (5*- 
ATTCTTTTGCCACTATIT TTCTTT-3', SEQ ID NO. 25) complementary to 
these regions were designed in order to amplify the entire ECa28'l 
gene. 

<Yrq"<^nring of F. ranis isolate?; 

DNA was sequenced with an ABI Prism 377 DNA Sequencer 

(Perkin- Elmer Applied Biosystems, Foster City, CA). The entire Eca28- 
15 / genes of seven R canis isolates (four from North Carolina, and one 

each from Oklahoma, Florida, and Louisiana) were amplified by PCR 

with primers EC280M-F (SEQ ID No. 24) and EC280M-R (SEQ ID No. 

25) with a thermal cycling profile of 95**C for 5 minutes, and 30 cycles 

of 95^C for 30 seconds, 62*»C for 1 minutes, and 72^C for 2 minutes 
20 and a 72**C extension for 10 minutes. The resulting PCR products were 

bidirectionally sequenced with the same primers. 

EXAMPLE 6 

25 

rinriina and Fxpression of F rants FCnlR-l 

The entire £. canis ECa28-l gene was PCR-amplified with 
primers-EC280M-F and EC280M-R and cloned into pCR2.1"TOPO TA 
cloning vector to obtain the desired set of restriction enzyme cleavage 
30 sites (Invitrogen, Carlsbad, CA).. The insert was excised from pCR2.1- 

29 
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TOPO with BstX I and ligated into pcDNA 3.1 eukaryotic expression 
vector (Inviirogen, Carlsbad, CA) designated pcDNA3.1/EC28 for 
subsequent studies. The pcDNA3.1/EC28 plasmid was amplified, and 
the gene was excised with a Kpnl-Xbal double digestion and 
5 directionally ligated into pThioHis prokaryotic expression vector 
(Invitrogen, Carlsbad, CA). The clone (designated pThioHis/EC28) 
produced a recombinant thioredoxin fusion protein in Escherichia coli 
BL21. The recombinant fusion protein was crudely purified in the 
insoluble phase by centrifugation. The control thioredoxin fusion 
10 protein was purified from soluble cell lysates under native conditions 
using nickel-NTA spin columns (Qiagen, Santa Clarita, CA). 



FYAMPrK? 

15 

Wftstftm Tmmnnohlot Analysis 

Recombinant £. canis ECa28-l fusion protein was subjected 
to SDS-polyacrylamide gel electrophoresis (SDS-PAGE) on 4-15% Tris- 
HCl gradient gels (Bio-Rad, Hercules, CA) and transferred to pure 

20 nitrocellulose (Schleicher & Schuell, Keene, NH) using a semi-dry 
transfer cell (Bio-Rad, Hercules, CA). The membrane was incubated 
with convalescent phase antisera from an E, can/ j- infected dog diluted 
1:5000 for 1 hour, washed, and then incubated with an anti-canine IgG 
(H & L) alkaline phosphatase-conjugated affinity-purified secondary 

25 antibody at 1:1000 for 1 hour (Kirkegaard & Perry Laboratories, 
Gaithersburg, MD). Bound antibody was visualized with 5-bromo-4- 
chloro-3-indolyl phosphate/nitroblue tetrazolium (BCIP/NBT) 
substrate (Kirkegaard & Perry Laboratories, Gaithersburg, MD). 
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EXAMPLES 

Sniirhern RInt Analy?;i5; 

To determine if multiple genes homologous to the ECa28-l 
5 gene were present in the £ canis genome, a genomic Southern blot 
analysis was performed using a standard procedure (Sambrook et al. 
1989). £1 canis genomic DNA digested completely with each of the 
restriction enzymes Banll, EcoRV, Haell, Kpn\ and Spel which do not 
cut within the ECalS-l gene, and Asel which digests ECa28-I at 
10 nucleotides 34, 43 and 656. The probe was produced by PCR 
amplification with primers EC280M-F and EC280M-R and digoxigenin 
(DIG)-labeled deoxynucleotide triphosphates (dNTPs) (Boehringer 
Mannheim, Indianapolis, IN) and digested with AseL The digested 
probe (566-bp) was separated by agarose gel electrophoresis, gel- 
15 purified and then used for hybridization. The completely digested 
genomic E. canis DNA was electrophoresed and transferred to a nylon 
membrane (Boehringer Mannheim, Indianapolis, IN) and hybridized at 
40*>C for 16 hr with the ECa28-l gene DIG-labeled probe in DIG Easy 
Hyb buffer according to the manufacturer's protocol (Boehringer 
20 Mannheim, Indianapolis, IN). Bound probe was detected with a anti- 
DIG alkaline phosphaiase-conjugated antibody and a luminescent 
substrate (Boehringer Mannheim, Indianapolis, IN) and exposed to 
BioMax scientific imaging film (Eastman Kodak, Rochester, NY). 

25 EXAMPf.E9 

Sequence Analysis and rnmpara^ton 

£ chaffeensis p28 and C ruminantium map-1 DNA 
sequences were obtained from the National Center of Biotechnology 
30 Information (NCBl) (World Wide Web site at URL: 
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http://www.ncbi.nlm.nih.gov/Entrez). Nucleotide and deduced amino 
acid sequences. and protein and phylogenetic analyses were 
performed with LASERGENE software (DNASTAR, Inc., Madison, WI). 
Analysis of post-translational processing was performed by the 
method of McGeoch and von Heijne for signal sequence recognition 
using the PSORT program (McGeoch, 1985; von Heijne, 1986) (World 
Wide Web site at URL: PRIVATE HREF= "http://www.imcb.osaka- 
u.ac.jp/nakai/form.htm", MACROBUTTON HtmlResAnchor 

http://www.imcb.osaka-u.ac.jp/nakai/form.htm). 

GenBank accession numbers for nucleic acid and amino 
acid sequences of the E. cams ECa28-l genes described in this study 
are: Jake, AF082744; Louisiana, AF082745; Oklahoma, AF082746; 
Demon, AF082747; DJ, AF082748; Fuzzy, AF082749; Florida, 
AF082750. 

Sequence analysis of ECa28-l from seven different strains 
of E. canis was performed with primers designed to amplify the entire 
gene. Analysis revealed the sequence of this gene was conserved 
among the isolates from North Carolina (four), Louisiana, Florida and 
Oklahoma. 

F-XAMPr F in 

PCR AmnliflC a rinn Clnninp Spqiipnring a n d F.yprPQcinn of F.rn7ft.I 

Alignment of nucleic acid sequences from £. chaffeensis 
p28 and Cowdria ruminantium map-1 using the Jotun-Hein aligorithm 
produced a consensus sequence with regions of high homology 
(>90%). These homologous regions (nucleotides 313-332 and 823- 
843 of C. ruminantium map-1; 307-326 and 814-834 ot E. chaffeensis 
p28) were targeted as primer annealing sites for PCR amplification. 
PCR amplification of the E. canis ECa28-I and K chaffeensis p28 gene 
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was accomplished with primers 793 and 1330, resulting in a 5 18- bp 
PCR product. The nucleic acid sequence of the K canis PO^ product 
was obtained by sequencing the product directly with primers 793 and 
1330. Analysis of the sequence revealed an open reading frame 
5 encoding a protein of 170 amino acids, and alignment of the 518-bp 
sequence obtained from PCR amplification of canis with the DNA 
sequence of E. chajfeensis p28 gene revealed a similarity greater than 
70%, indicating that the genes were homologous. Adapter PCR with 
primers 394 and 793C was performed to determine the 5' and 3 * 
10 segments of the sequence of the entire gene. Primer 394 produced 
four PCR products (3-kb, 2-kb, l-kb, and O.S-kb), and the 0.8-bp 
product was sequenced bidirectionally using primers 394 and API. 
The deduced sequence overlapped with the 3' end of the 518-bp 
product, extending the open reading frame 12-bp to a termination 
15 codon. An additional 625-bp of non-coding sequence at the 3* end of 
the ECa28-l gene was also sequenced. Primer 394C was used to 
amplify the 5* end of the ECalS-l gene with supplied primer API, 
Amplification with these primers resulted in three PGR products (3.3, 
3-kb, and 2-kb). The 2-kb fragment was sequenced unidirectionally 
20 with primer 793C. The sequence provided the putative start codon of 
the ECa28'I gene and completed the 834-bp open reading frame 
encoding a protein of 278 amino acids. An additional 144-bp of 
readable sequence in the 5* noncoding region of the ECa28'I gene was 
generated. Primers EC280M-F and EC280M-R were designed from 
25 complementary non-coding regions adjacent to the ECa28-l gene. 

The PCR product amplified with these primers was 
sequenced directly with the same primers. The complete DNA 
sequence (SEQ ID NO. 1) for the £1 canis ECa28'I gene is shown in 
Figure 1. The ECa28'l PCR fragment amplified with these primers 
3t) contained the entire open reading frame and 17 additional amino 

33 
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acids from the 5' non-coding primer region. The gene was 
directionally subcloned into pThioHis expression vector, and E. coli 
(BL2I) were iransformed with this construct. The expressed ECa28-l- 
thioredoxin fusion protein was insoluble. The expressed protein had- 
5 an additional 114 amino acids associated with the thioredoxin, 5 
amino acids for the enterokinase recognition site, and 32 amino acids 
from the multiple cloning site and 5* non-coding primer region at the 
N-terminus. Convalescent-phase antiserum from an E, canis infected 
dog recognized the expressed recombinant fusion protein, but did not 
10 react with the thioredoxin control (Figure 2). 

EXAMPI.E n 

Sequence Homology 

15 The nucleic acid sequence of ECa28-l (834-bp) and the £ 

chaffeensis omp-I family of genes including signal sequences {ECalS- 
/, omp-IA, B, C D, £, and F) were aligned using the Cluscal method to 
examine homology between these genes (alignment not shown). 
Nucleic acid homology was equally conserved (68.9%) between ECalS- 

20 i, and iE*. chaffeensis p28 and omp-lF. Other putative outer 
membrane protein genes in the E. chaffeensis omp-I family, omp-lD 
(68.2%), ompAE (66.7%), omp-lC (64.1%), Cowdria ruminantium 
map'l (61.8%), £ canis 28-kDa protein 1 gene (60%) and 28-kDa 
protein 2 gene (partial) (59.5%) were also homologous to ECa28-L E 

25 chaffeensis omp-JB had the least nucleic acid homology (45.1%) with 
KCa28'L 

Alignment of the predicted amino acid sequences of 
ECa28-l (SEQ ID NO. 2) and E. chaffeensis P28 revealed amino acid 
substitutions resulting in four variable regions (VR). Substitutions or 
30 aeletions in the amino acid sequence and the locations of variable 

34 
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regions of ECa28-l and the E, chaffeensis OMP-1 family were identified 
(Figure 3). Amino acid comparison including the signal peptide 
revealed that ECa28-l shared the most homology with OMP-IF (68%) 
of the £ chaffeensis OMP-I family, followed by £ chaffeensis P28 
(65.5%), OMP-IE (65.1%), OMP-ID (62.9%), OMP-IC (62.9%). 
Cowdria ruminantium MAP-1 (59.4%), E. canis 28-kDa protein 1 
(55.6%) and 28-kDa protein 2 (partial) (53.6%), and OMP-IB 
(43.2%). The phylogenetic relationships based on amino acid 
sequences show that ECa28-l and C ruminantium MAP-1, £ 
chaffeensis OMP-1 proteins, and £ canis 28-kDa proteins I and 2 
(partial) are related (Figure 4). 

RXAMPf F \J 

Predicted Snrfnce Prohnhility and rmmunnreactivity 

Analysis of £ canis ECa28-l using hydropathy and 
hydrophilicity profiles predicted surface-exposed regions on ECa28-l 
(Figure 6). Eight major surface-exposed regions consisting of 3 to 9 
amino acids were identified on ECa28-l and were similar to the profile 
of surface-exposed regions on £ chaffeensis P28 (Figure 6). Five of 
the larger surface-exposed regions on ECa28-l were located in the N- 
terminal region of the protein. Surface-exposed hydrophilic regions 
were found in all four of the variable regions of ECa28-l. Ten T-cell 
motifs were predicted in the ECa28-I using the Rothbard-Taylor 
aligorilhm (Rothbard and Taylor, 1988), and high antigenicity of the 
ECa28-l was predicted by the Jameson-Wolf antigenicity aligorithm 
(Figure 6) (Jameson and Wolf, 1988). Similarities in antigenicity and 
T-cell motifs were observed between ECa28-I and £ chaffeensis P28. 
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Delpctinn of Homnlogniis r.pnnmi r rnpip<: of F.rn7H-l r.fr^p 

Genomic Southern blot analysis of E. canis DNA completely 
5 digested independently with restriction enzymes Banll, EcoRV. Haell, 
Kpnl, Spel, which do not have restriction endonuclcase sites in the 
ECa28-I gene, and Asel, which has internal restriction endonuclcase 
sites at nucleotides 34, 43 and 656, revealed the presence of at least 
three homologous ECa28-J gene copies (Figure 5). Although ECa28-l 

10 has internal Ase I internal restriction sites, the DIG-labeled probe used 
in the hybridization experiment targeted a region of the gene within a 
single DNA fragment generated by the Asel digestion of the gene. 
Digestion with Asel produced 3 bands (approximately 566-bp, 850 - 
bp, and 3-kb) that hybridized with the ECa28-l DNA probe indicating 

15 the presence of multiple genes homologous to ECa28-l in the genome. 
Digestion with EcoRV and Spel produced two bands that hybridized 
with the ECa28-J gene probe. 

EYAMPT.F 14 

20 

Identification of 2R-kDa PrnfHn fiene T nrii«; 

Specific primers designated ECaSA3-2 (5'-CTAGGATTA 
GGTTATAGTATAAGTT-3', SEQ ID No. 26) corresponding to regions 
within ECa28SA3 and primer 793C (SEQ ID No. 23) which anneals to a 

25 region with ECa28-l were used to amplify the intergenic region 
between gene S A3 &nd ECa28-J. The 800-bp product was sequenced 
with the same primers. DNA was amplified with a thermal cycling 
profile of 95''C for 2 min, and 30 cycles of 95°G for 30 sec, 50''C for 1 
min, 72°C for 1 min followed by a 72°C extension for 10 min and 4''C 

30 hold. 
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FYAMPI K IS 

PTR Amplificntinn of 28-lcDn Protein Hf^nef; nnd Identification of the 

lyfitUtple Gene Locus 
5 In order to specifically amplify possible unknown genes 

downstream of ECa28SA2, primer 46f specific for ECa28SA2, and 
primer 1330 which targets a conserved region on the 3* end of ECa28' 
1 gene were used for amplification. A 2-kb PGR product was amplified 
with these primers that contained 2 open reading frames. The first 

10 open reading frame contained the known region of gene, ECaSA2 ^ 3.nd 
a previously unsequenced 3' portion of the gene. Downstream from 
ECaSA2 an additional non identical, but homologous 28-kDa protein 
gene was found, and designated ECa28SA3. The two known loci were 
joined by amplification with primer SA3-2 specific for the 3' end of 

15 ECa28SA3 gene was used in conjunction with a reverse primer 793C, 
which anneals at 5* end of ECa28-L An SOO-bp PGR product was 
amplified which contained the 3* end of Eca28SA3, the intergenic 
region between ECa28SA3 and ECa28-J (28NC3) and the 5' end of 
Eca25-7, joining the previously separate loci (Figure 8). The 849-bp 

20 open reading frame of ECa28SA2 encodes a 283 amino acid protein, 
and ECa28SA3 has an 840-bp open reading frame encoding a 28 0 
amino acid protein. The intergenic noncoding region between 
ECa28SA3 and ECa28-J was 345-bp in length (Figures 7 and 8) 

25 EXAMPLE 16 

Nnnlric and Amino Acid Homology 

The nucleic and amino acid sequences of all five E, canis 
28-kDa protein genes were aligned using the Clustal method to 
30 examine the homology between these genes. The nucleic acid 

37 
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homology ranged from 58 to 75% and a similar amino acid homology 
of ranging from 67 to 72% was observed between the E. canis 28-kDa 
protein gene members (Figure 9). 

EXAMPTrE 17 

Transcriptional P romotfir Regions; 

The intergenic regions between the 28-kDa protein genes 
were analyzed for promoter sequences by comparison with consensus 
Escherichia coli promoter regions and a promoter from £. chaffeensis 
(Yu e/ a/., 1997; McClure, 1985). 

Putative promoter sequences including RBS, -10 and -35 
regions were identified in 4 intergenic sequences corresponding to 
genes ECa28SA2, ECa28SA3, ECa28-J, and ECa28'2 (Figure 10). The 
upstream noncoding region of ECa28SAI is not known and was not 
analyzed. 

RXAMPl.F. 1« 

N-Terminal Signal Segnf^nrp 

The amino acid sequence analysis revealed that entire 
E.canis ECa28-l has a deduced molecular mass of 30.5-kDa and the 
entire ECa28SA3 has a deduced molecular mass of 30.7-kDa. Both 
proteins have a predicted N-ierminal signal peptide of 23 amino acids 
(MNCKKILnTALMSLMYYAPSIS, SEQ ID No. 27), which is similar to that 
predicted for E chaffeensis P28 (MNYKKEJTSALISLISSLPGV SFS, SEQ ID 
NO. 28), and the OMP-I protein family (Yu et aL. 1998; Ohashi et ai, 
1998b). A preferred cleavage site for signal peptidases (SIS; Ser-X- 
Ser) (Oliver, 1985) is found at amino acids 21, 22, and 23 of ECa28^L 
An additioaal putative cleavage site at amino acid pDuttiDn 25 

38 
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(MNCKKIUTTALISLMYSIPSISSFS, SEQ ID NO. 29) identical to the 
predicted cleavage site of E chaffeensis P28 (SFS) was also present, 
and would result in a mature ECa28-l with a predicted molecular mass 
of 27.7-kDa. Signal cleavage site of the previously reported partial 
5 sequence of ECa28SA2 is predicted at amino acid 30. However, signal 
sequence analysis predicted that ECa28SAl had an uncleavable signal 
sequence. 
Summary 

Proteins of similar molecular mass have been identified 

10 and cloned from multiple rickettsial agents including E. canis, E 
chaffeensis, and C ruminantium (Reddy et aL, 1998; Jongejan et aL, 
1993; Ohashi et al, 1998). A single locus in Ehrlichia chaffeensis with 
6 homologous p28 genes, and 2 loci in E. canis, each containing some 
homologous 28-kDa protein genes have been previously described. 

15 The present invention demonstrated the cloning, 

expression and characterization of genes encoding a mature 28-kDa 
protein of £ canis that are homologous to the omp-l multiple gene 
family of E, chaffeensis and the C. ruminantium map-I gene. Two new 
28-kDa protein genes were identidfied, Eca28-J and ECa28SA3. 

20 Another E.canis 28-kDa protein gene, ECa28SA2, partially sequenced 
previously (Reddy et aL, 1998), was sequenced completely in the 
present invention. Also disclosed is the identification and 
characterization of a single locus in E.canis containing all five E.canis 
28-kDa protein genes. 

25 The E.canis 28-kDa protein are homologous to 

E. chaffeensis OMP-1 family and the MAP-1 protein of C rumanintium. 
The most homologous canis 28-kDa proteins (ECa28SA3, ECa28-l 
and ECa28-2) are sequentially arranged in the locus. Homology of 
these proteins ranged from 67.5% to 72.3%. Divergence among these 

30 28-kDa proteins was 27.3% to 38.6%. E. canis 28-kDa proteins 
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ECa28SAl and ECa28SA2 were the least homologous with homology 
ranging from 50.9% to 59.4% and divergence of 53.3 to 69.9%. 
Differences between the genes lies primarily in the four hypervariable 
regions and suggests that these regions are surface exposed and 
5 subject to selective pressure by the immune system. Conservation of 
£Cfl25- 7 among seven £ can «\r isolates has been reported (McBride et 
aL, 1999), suggesting that Kcanis may be clonal in North America. 
Conversely, significant diversity of p28 among E. chaff e ens is isolates 
has been reported (Yu et aL. 1998). 

10 All of the E. canis 28-kDa proteins appear to be post 

translationally processed from a 30-kD protein to a mature 28-kD 
protein. Recently, a signal sequence was identified on £1 chaffeensis 
P28 (Yu et al, 1998), and NMerminal amino acid sequencing has 
verified that the protein is post-translationally processed resulting in 

15 cleavage of the signal sequence to produce a mature protein (Oh as hi 
et aL, 1998). The leader sequences of OMP-IF and OMP-IE have also 
been proposed as leader signal peptides (Ohashi et ai, 1998). Signal 
sequences identified on E. chaffeensis OMP-IF, OMP-IE and P28 are 
homologous to the leader sequence of E. canis 28-kDa protein. 

20 Promoter sequences for the p28 genes have not been determined 
experimentally, but putative promoter regions were identified by 
comparison with consensus sequences of the RBS. -10 and -35 
promoter regions of £ coli and other ehrlichiac (Yu et al., 1997; 
McClure, 1985). Such promoter sequences would allow each gene to 

25 potentially be transcribed and translated, suggesting that these genes 
may be differentially expressed in the host. Persistence of infection in 
dogs may be related to differential expression of p28 genes resulting 
in antigenic changes in vivo, thus allowing the organism to evade the 
immune response. 
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The £. canis 28-kda protein genes were found to exhibit 
nucleic acid and amino acid sequence homology with the E 
chaff eensis omp-l gene family and C ruminantium map- 1 gene. 
Previous studies have identified a 30-kDa protein of E. canis that 
5 reacts with convalescent phase antisera against £. chaff eensiSy but was 
believed to be antigenically distinct (Rikihisa et aL, 1994). Findings 
based on comparison of amino acid substitutions in four variable 
regions of E canis 28-kDa proteins support this possibility. Together 
these findings also suggest that the amino acids responsible for the 

10 antigenic differences between E. canis and E. chaffeensis P28 are 
located in these variable regions and are readily accessible to the 
immune system. Ii was reported that immunoreactive peptides were 
located in the variable regions of the 28-kDa proteins of C 
ruminantium, E. chaffeensis and E, canis (Reddy et aL, 1998). Analysis 

15 of £ canis and E. chaffeensis P28 revealed that all of the variable 
regions have predicted surface-exposed amino acids. A study in dogs 
demonstrated lack of cross protection between £. canis and E 
chaffeensis (Dawson and Ewing, 1992). This observation may be 
related to antigenic differences in the variable regions of P28 as well 

20 as in other immunologically important antigens of these chrlichial 
species. Another study found that convalescent phase human antisera 
from E. chaffeensis-infecttd patients recognized 29/28-kDa protein(s) 
of E. chaffeensis and also reacted with homologous proteins of E. canis 
(Chen et aL, 1997). Homologous and crossreactive epitopes on the £ 

25 canis 28-kDa protein and £. chaffeensis P28 appear to be recognized 
by the immune system. 

£ canis 28-kDa proteins may be important 
immunoprotective antigens. Several reports have demonstrated that 
the 30-kDa antigen of £ canis exhibits strong immunoreactivity 

30 (Rikihisa et aL, 1994; Rikihisa et at., 1992). Antibodies in 
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convalescent phase antisera from humans and dogs have consistently 
reacted with proteins in this size range from E. chaffeensis and £ 
canis, suggesting that they may be important immunoprotective 
antigens (Rikihisa et ai, 1994; Chen et ai, 1994; Chen et al., 1997). In 
5 addition, antibodies to 30, 24 and 21-kDa proteins developed early in 
the immune response to E canij (Rikihisa et al, 1994; Rikihisa et al, 
1992), suggesting that these proteins may be especially important in 
the immune responses in the acute stage of disease. Recently, a family 
of homologous genes encoding outer membrane proteins with 
0 molecular masses of 28-kDa have been identified in E. chaffeensis, and 
mice immunized with recombinant £. chaffeensis P28 appeared to 
have developed immunity against homologous challenge (Ohashi et 
al, 1998). The P28 of £. chaffeensis has been demonstrated to be 
present in the outer membrane, and immunoelectron microscopy has 
5 localized the P28 on the surface on the organism, and thus suggesting 
that it may serve as an adhesin (Ohashi et al, 1998). It is likely that 
the 28-kDa proteins of E canis identified in this study have the same 
location and possibly serve a similar function. 

Comparison of ECa28-I from different strains of E. canis 
revealed that the gene is apparently completely conserved. Studies 
involving R chaffeensis have demonstrated immunologic and 
molecular evidence of diversity in the ECa28-L Patients infected with 
R chaffeensis have variable iramunoreactivity to the 29/28-kDa 
proteins, suggesting that there is antigenic diversity (Chen et al, 

1997) . Recently molecular evidence has been generated to support 
antigenic diversity in the p28 gene from K chaffeensis (Yu et al, 

1998) . A comparison of five E. chaffeensis isolates revealed that two 
isolates (Sapulpa and St. Vincent) were 100% identical, but three 
others (Arkansas, Jax, 91HE17) were divergent by as much as 13.4% 
at the amino acid level. The conservation of ECa28-I suggests that £ 
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canis strains found in the United States may be genetically identical, 
and thus E. canis 28-kDa protein is an attractive vaccine candidate for 
canine ehrlichiosis in the United States. Further analysis of E. canis 
isolates outside the United States may provide information regarding 
the origin and evolution of £. canis. Conservation of the 28-kDa 
protein makes it an important potential candidate for reliable 
serodiagnosis of canine ehrlichiosis. 

The role of multiple homologous genes is not known at 
this point; however, persistence of E.canis infections in dogs could 

conceivably be related to antigenic variation due to variable 

expression of homologous 28-kDa protein genes, thus enabling E 

canis to evade immune surveillance. Variation of msp-S genes in A. 

marginale is partially responsible for variation in the MSP-3 protein, 

resulting in persistent infections (Alleman et aL, 1997). Studies to 

examine 28-kDa protein gene expression by E canis in acutely and 

chronically infected dogs would provide insight into the role of the 

28-kDa protein gene family in persistence of infection. 
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Any patents or publications mentioned in this specification 

are indicative of the levels of those skilled in the art to which the 
25 invention pertains. These patents and publications are herein 

incorporated by reference to the same extent as if each individual 

publication was individually indicated to be incorporated by 

reference. 

One skilled in the art will readily appreciate that the 
3T) present invention is well adapted to carry out the objects and obtain 



wo 00/32745 PCT/US99/28075 

the ends and advantages mentioned, as well as those inherent therein. 
The present examples along with the methods, procedures, 
treatments^ molecules, and specific compounds described herein are 
presently representative of preferred embodiments, are exemplary, 
5 and are not intended as limitations on the scope of the invention. 
Changes therein and other uses will occur to those skilled in the art 
which are encompassed within the spirit of the invention as defined by 
the scope of the claims. 
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1. DNA sequences encoding a 30-kilodalton protein of 
Ehrlichia canis, wherein said protein is immunoreactive with anii- 
5 Ehrlichia canis serum. 

2. The DNA sequences of claim 1, wherein said protein 
has an amino acid sequence selected from the group consisting of SEQ 

1 0 ID No, 2, SEQ ID No. 4 and SEQ ID No. 6. 

3. The DNA sequences of claim 2, wherein said protein 
has an N-terminal signal sequence. 

15 

4. The DNA sequences of claim 3, wherein said protein 
is post-translationally modified to a 28-kilodalton protein. 

20 

5. The DNA sequences of claim 1, wherein said DNA has 
a sequence selected from the group consisting of SEQ ID No. 1, SEQ ID 
No. 3 and SEQ ID No. 5. 

25 

6. The DNA sequences of claim 1, wherein said DNA is 
contained in a single locus of Ehrlichia canis. 
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7. The DNA sequences of claim 6, wherein said locus is 
a multigene locus of 5.592 kb in length. 



5 8. The DNA sequences of clainn 7, wherein said locus 

encoding homologous 28-kiiodaItoni proteins of Ehrlichia canis. 

9. The DNA sequences of claim 8, wherein said 
10 homologous 28-kilodalton proteins of Ehrlichia canis are selected 

from the group consisting of ECa28SAI, ECa28SA2, ECa28SA3, ECa28-l 
and ECa28-2. 

10. A vector comprising the DNA sequences of claim 1. 

15 

11. The vector of claim 10, wherein said vector is an 
expression vector capable of expressing a peptide or polypeptide 
encoded by the sequence selected from the group consisting of SEQ ID 

20 No. U SEQ ID No. 3 and SEQ ID No. 5 when said expression vector is 
introduced into a cell. 

12. A recombinant protein comprising the amino acid 
25 sequence selected from the group consisting of SEQ ID No. 2, SEQ ID 

No. 4 and SEQ ID No. 6. 

13. The recombinant protein of claim 12, wherein said 
30 amino acid sequence is encoded by a nucleic acid segment comprising 

4 7 
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a sequence selected from the group consisting of SEQ ID No, 1, SEQ ID 
. No. 3 and SEQ ID No. 5. 

5 14. A host cell comprising the nucleic acid segment 

selected from the group consisting of SEQ ID No. 1, SEQ ID No. 3 and 
SEQ ID No. 5. 

10 15. A method of producing the recombinant protein of 

claim 12, comprising the steps of: 

obtaining a vector that comprises an expression region 
comprising a sequence encoding the amino acid sequence selected 
from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID 
15 No. 6 operatively linked to a promoter; 

transfecting said vector into a cell; and 

culturing said cell under conditions effective for 
expression of said expression region. 

20 

16. An antibody immunoreactive with an amino acid 
sequence selected from the group consisting of SEQ ID No. 2, SEQ ID 
No. 4 and SEQ ID No. 6. 

25 17. A method of inhibiting Ehrlichia canis infection in a 

subject comprising the steps of: 

identifying a subject suspected of being exposed to or 
infected with Ehrlichia canis; and 
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administering a composition comprising a 28-kDa antigen 
of Ehrlichia canis in an amount effective to inhibit an Ehrlichia canis 
infection. 

5 

18. The method of claim 17, wherein said 28-kDa 
antigen is a recombinant protein comprising an amino acid sequence 
selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and 
SEQ ID No. 6. 

10 

19. The method of claim 18, wherein said recombinant 
protein is encoded by a gene comprising a sequence selected from the 
group consisting of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5. 

15 20. The method of claim 18, wherein said recombinant 

protein is dispersed in a pharmaceutically acceptable carrier. 



49 



wo 00/32745 



PCT/US99/28075 



1 ATTT TATITATTACCMTaTATATAArATATrAMTITCTCTrACAAAAATCTCTAATG 60 

61 TTITATACCTAATAraTATATrCTGGCmr^^ 120 

121 rrArmrACTATmAGGrGTAATATGAATTGCAAAAAAATrCTTATAACA^CT^ 180 

H r. ^ P. 1 I I T T A r 

16 1 .iJiTATCAlTAATGTACTCTArrCCAAGCATATCrTrTTCTCATACTATACAAGATGGrM ?40 
r ' M y S T P r c; F SDriQDGN 

241 CATG(K?rcCTAACTrCTATATrAGTGGAAAGTATGTACCAAGTGTCTCACATTITGC^^ 300 
MOGNFYI SGKYVPSVSHFGS 

301 ClTCTCAGCTAAAGAAGAAAGCAAATCAACTGriTGGAGTrrTrGGAITAAAACATGATrG 360 
FSAKEESRSTVGVFGLKHDW 

36;. 'rr-ATGGAAGrCCAATACTrAAGAAlAAACACGGrGACTTTACnxnTCCAAAC^^ 420 
DGSPILKNKHADFTVPHYSF 

421 CAGATACGAGAACAATCCAirrCTAGGGITTGCAGGAGCrATCGGITACTCAATSGGTGG 480 
RYENNPFLOFAGAIGYSMOG 

481 CCCAAGAATAGAATTCGAAATATCITATGAAGCAITCGACGrAAAAAGTCCTAATATCAA 540 
PRIEFEISYEAF DVRSPNIN 

541 TTATCAAAATGACGCGCACAGGTACTGCGCrCTATCTCATCACACATCGGCAGCCATGQA 600 
YQMDAHRYCALSHHTSAAME 

601 AGCTGATAAATTTCJrCTrCITAAAAAACGAAGGGTrAATTGACATATCACTTGCAATAAA 660 
ADRFV FLRNEGLIDISLAIN 

661 TGCATGITATGATATAATAAATGACAAAGrACCTGnTCrCCITATATATGCGCAGGrAT 720 
ACYDIINDKVPVSPYICAGI 

721 TGGTACTGATTTGATITCTATGmGAAGCTACAAGTCCTAAAArrrCCTACCAAGGAAA 78U 
GTDLI SMFEATSPKISYQGK 

781 ACTGGGCATrAGrrACTCTATTAATCCGGAAACCTCTGmTCATCGGrGGGCATrrCCA 840 
LGISYSIMPETSVFIGGHFH 

841 CAGGATCATAGGrAATGAGTTTAGAGATATTCCTGCAATAGrACCTACTAACTCMCrAC 900 
RilONEFRDlPAIVPSNSTT 

901 AATAAGTGGACCACAATITGCAACAGTAACACTAAATGTmxrrCACTTTGGTTrAGAACT 960 
I SGPQFATVTLNVCHFGLEL 

9fil TGGAGGAAGAmAACnTCTlUlTTTrArrGrroCCA(a.TATrAAAAATGATCT 1020 
G G R F N F (SEQ ID NO: 2) 

1021 1 i 1 1 1 AWATTGCTACATACAAAAAAAGAAAAATAGTGGCAAAAGAATGTAGCAATAAGA 1080 
1081 GGGGGGGGGGGGACCAAATITATCrrcTATGCTrCCCAAGriTmiOT 1140 
1141 CTTAAACAACAGAAGGTAATATCCTCACGGAAAACITATCTrCAAATATriTATTTATrA 1200 
1201 CCAATCirATATAATA TATTAA ATITCnumCAAAAATXIACTAGrATm 1260 
1261 TATATATrCIGACTrG<nTTTCrn-CTGCACTrCTACTATrriTAATn-AT^ 1320 
1321 TAGGTTATAATAAWATGAATrGCMAAAQATTmCATAGCAAgrGCATnjATAT^ 1380 
1381 T GTC-li rCIiACCrAGCGTATCTrnTCrGAATCAATACATGAAOATAATATAAATGGTA L440 
1441 ACTTmCAmGTGCAAACTATATGCCAAGTGCCTrACACTTTGGCGTATmCAGTTA 1500 
1501 AAGAAGAGAAAAACACAACAACTGGAGrrrTCGGATTAAAACAAGATTGGOACGGAOCAA 1560 
1561 CACTAAAGGATGCAAGCWGCAGCCACACAVTAGACCCAAGTACAATG 1607 

^ (SEQ ID NO: 1) 

rlG. 1 
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9416-bp 
6557-bp 
4361 -bp 



2322-bp 
2027-bp 



564-bp 



1 2 3 4 5 6 7 8 
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£ca28SA2 

ATGAATTGTAAAAAAGTTTTCACAATAAGTGCATTGATATCATCCATATACTTCCTACCT 6 0 
MNCKKVFTISALISSIYFLP 

AATGTCTCATACTCTAACCCAGTATATGGTAACAGTATGTATGGTAATTTTTACATATC A 120 
NV .^YSNPVYGNSMYGNFYI S 

GGAAAGTACATGCCAAGTGTTCCTCATTTTGGAATTTTTTCAGCTGAAGAAGAGAAAAAA 180 
GKYMPSVPHF GIFSABEEKK 

AAGACAACTGTAGTATATGGCTTAAAAGAAAACTGGGCAGGAGATGCAATATCTAGTCAA 240 
KTTVVYGLKENWAGDAIS SQ 

AGTCCAGATGATAATTTTACCATTCGAAATTACTCATTCAAGTATGCAAGCV.Ci^>x'7TTT 300 
S PDDNFTI RN.YS FK YASN K F 

TTAGGGTTTGCAGTAGCTATTGGTTACTCGATAGGCAGTCCAAG AATAGAAGTTGAGATG 360 
LGFAVAIGYS IGSPRIEV EM 

TCTTATGAAGCATTTGATGTGAAAAATCCAGGTGATAATTACAAAAACGGTGCTTACAGG 420 
SYEAFDVKNPGDNYK NGAYR. 

TATTGTGCTTTATCTCATCAAGATGATGCGGATGATGACATGACTAGTGCAACTGACAAA 480 
YCALSHQDDADDDMTSATDK 

TTTGTATATTTAATTAATGAAGGATTACTTAACATATCATTTATGACAAACATATGTTAT 54 0 
FVYLINEGLLNISFMTNI CY 

GAAACAGCAAGCAAAAATATACCTCTCTCTCCTTACATATGTGC AGGT ATTGGTACTG AT 600 
ETASKNIPLSPYICAGIGTD 

TTAATTCACATGTTTGAAACTACACATCCTAAAATTTCTTATCAAGGAAAGCTAGGGTTG 660 
LIHM FETTHPKISYQGKLGL 

GCCTACTTCGTAAGTGCAGAGTCTTCGGTTTCTTTTGGTATATATTTTCAT^ .^^^ .II". ATA 720 
AYFVSAESSVSFGIYFHKI I 

AATAATAAGTTTAAAAATGTTCCAGCCATGGTACCTATTAACTCAGACGAGATAGTAGGA 780 
NNKFKNVPAMVPIMSDEI. VG 

CCAC AGTTTGCAAC AGTAACATTAAATGT ATGCTACTTTGG ATTAG AACTTGG ATGTAGG 840 
PQF ATVTLNVCY FGLELG C R 
i (SEQ ID NO: 3) 

TTCAACTTCTAATTTCGTGGTAC ACATATC ACGAAGCTAAAATTGTTTTTTTATCTCTGC 900 
F N F * (SEQ ID NO: 4) 

TGTATACAAGAGAAAAAATAGTAGTGAAAATTACCTAACAATATGACAGTACAAGTTTAC 960 
C^AGCTTAT^CTCACAAAACTTCITGTGTCTTTTA^^ 1020 

FIG. 7-1 
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AGCTTCACTACTGTAGAGTGTGTTTATCAATGCTTTGTTTATTAATACTCTACATAATAT 1080 
GTTAAATTTTTCTTACAAAACTCACTAGTAATTTATACTAGAATATATATTCTGACTTGT 1140 
ECa28SA3 ^^EQ ID NO: 31) 



ATTTGCnTTATACTTCCACTATTGTTAATTTATTTTCACTATTTTAGGTGTAATATGAAT 1200 



4' 

M N 



TGCAAAAAAATTCTTATAACAACTGCATT AATGTCATTAATGTACTATGCTCCAAGCATA 1260 
CKKILITTALMSLMYYAPSI 

TCTTTTTCTGATACTATACAAGACGATAACACTGGTAGCTTCTACATCAGTGGAAAATAT 1320 
SFSDTIQDDNTGSFYISGKY 

GTACCAAGTGTTTCACATTTTGGTGTTTTCTCAGCTAAAGAAGAAAGAAACTCAACTGTT 1 3 B 0 
VPSVSHFGVFSAKEBRNSTV 

GGAGTTTTTGGATTAAAACATGATTGGAATGGAGGTACAATATCTAACTCTTCTCCAGAA 1440 
GVFGL.KHDWNGGTI3^7SSPl; 

AAT ATATTCACAGTTCAAAATTATTCGTTTAAATACGAAAACAACCCATTCTTAGGGTTT 1500 
NIFTVQNYS F KYENNPFLGF. 

GCAGGAGCTATTGGTTATTCAATGGGTGGCCCAAGAATAGAACTTGAAGTTCTGTACGAG 1 ^ S 0 
AGAIGYSMGGPRIELEVLYE 

ACATTCGATGTGAAAAATCAGAACAATAATTATAAGAACGGCGCACACAGATACTGTGCT 1620 
TFDVKNQNNNYKNGAHRYCA 

TTATCTCATCATAGTTCAGCAACAAGCATGTCCTCCGCAAGTAACAAATTTGTTTTCTTA 1680 
LSHHSS ATSMS SASNKFVFL 

AAAAATG AAGGGTTAATTGACTTATCATTTATGATAAATGCATGCTATGACATAAtAATT 174 0 
KNEGLIDLSFMINACYDI II 

G AAGGAATGCCTTTTTCACCTTATATTTGTGCAGGTGTTG6TACTGATGTTGTTTCCATG 1800 
EGMPFSPYICAGVGTDVVSM 

TTTGAAGCTATAAATCCTAAAATTTCTTACCAAGG AAAACTAGGATTAGGTTATAGTAT A 1860 
FEAINPKISYQGKLGL GYC. I 

AGTTOVGAAGCCTCTGTTTTTATCGGTGGACACTTTCACAGAGTCATAGGTAATGAATTT 1920 
S5EASVFIGGHFHRVIGNEF 

AGAGACATCCCTGCTATGGTTCCTAGTGGATCAAATCTTCCAGAAAACCAATTTGCAATA 1 S 8 0 
RDIPAMVPSGSNLPENQFAI 

(SEQ ID NO: 5) 

GTAACACTAAATGTGTGTCACTTTGGCATAGAACTTGGAGGAAGATTTAACTTCTGA 2031 
VTL MVCHFGI -El-GGRFMF* 

FIG. 7-2 (SHQIDNO:6) 
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SEQUENCE LISTING 

<110> • Walker, David K, 
McBride, Jere W. 
Yu, Xue-Jie 

<120> Homologous 28-Kilodalton Immunodominant Protein 

Genes of Ehrlichia canis and Uses Thereof 

<i30> D6152PCT 

<141> 1999-11-30 

<150> 09/261,358 

<151> 1999-03-03 

<160> 33 

<210> 1 

<211> 1607 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of ECa28-l 

<400> 1 



attttattta 


ttaccaatct 


tatataatat 


attaaatttc 


tcttacaaaa 


50 


atctctaatg 


ttttatacct 


aatatatata 


ttctggcttg 


tatctacttt 


100 


gcacttccac 


tattgttaat 


ttattttcac 


tattttaggt 


gtaatatgaa 


150 


ttgcaaaaaa 


attcttataa 


caactgcatt 


aatatcatta 


atgtactcta 


200 


ttccaagcat 


atctttttct 


gatactatac 


aagatggtaa 


catgggtggt 


250 


aacttctata 


ttagtggaaa 


gtatgtacca 


agcgtctcac 


attttggtag 


300 


cttctcagct 


aaagaagaaa 


gcaaatcaac 


tgctggagtt 


tttggattaa 


350 


aacatgattg 


ggatggaagt 


ccaatactta 


agaataaaca 


cgctgacttt 


400 


actgttccaa 


actattcgtt 


cagatacgag 


aacaatccat 


ttctagggtt 


450 


tgcaggagct 


atcggttact 


caatgggtgg 


cccaagaata 


gaattcgaaa 


500 


tatcttatga 


agcattcgac 


gtaaaaagtc 


ctaatatcaa 


ttatcaaaat 


550 


gacgcgcaca ggtactgcgc 


tctatctcat 


cacacatcgg 


cagccatgga 


600 


agctgataaa 


tttgtcttct 


taaaaaacga 


^gggttaatt 


gacatatcac 


650 


ttgcaataaa 


tgcatgttat 


gatataataa 


atgacaaagt 


acctgtttct 


700 


ccttatatat 


gcgcaggtat 


tggtactgat 


ttgatttcta 


tgtttgaagc 


750 


tacaagtcct 


aaaatttCGt 


accaaggaaa 


actgggcatt 


agttactcta 


800 


ttaatccgga 


aacctctgtt 


ttcatcggtg 


ggcatttcca 


caggatcata 


850 


ggtaatgagt 


ttagagatat 


tcctgcaata 


gtacctagta 


actcaactac 


900 


aal:aagLgga 


ccacaatttg 


caacagraac 


act:aaatgt:g 


tgtcactttg 


950 
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wo 00/32745 PCT/US99/2807S 



-J /TO a t~ 


L. y y ay y CI ay ci 




a a 1 1 1 1 a t tcr 


t tQCcacata 


1000 






tttttawtat 


tiactacatac 


aaaaaaagaa 


1050 


-3 a -} t~ ^ rr t~ nnc 
o.a.cxLo.cj uyy^-. 




acTcaa taaoa 


y y y y sj^y 3 


ggaccaaatt 


1100 


C3.L.CU UdU 


y k L. y — Ci 


V— V-* V— y V-^ 


c tatttatoa 


ct. uaaacaac 


1150 


-3 o/~T/~T f~ ^ t~ 

ay a.a.y y Uod u 




04. CI. CJL C4. i^CA \^ 


ttcaaatat t 


ttatttatta 


1200 






CICLCA \^ \«> Vm> k— ' v«- 


racaaaaatc 


accagtattt. 


1250 






gacttgcttt 


ccttctgcac 


ttctactatt 


1300 




U uy l^^a^ cat- 


taggttataa 


taawatgaat 


tiQcmaaacati 


1350 


ttttcatagc 


aagtgcattg 


atatcactaa 


tgtctttctt 


acctagcgta 


1400 


tctttttctg 


aatcaataca 


tgaagataat 


ataaatggta 


acttttacat 


1450 


tagtgcaaag 


tatatgccaa 


gtgcctcaca 


ctttggcgta 


ttttcagtta 


1500 


aagaagagaa 


aaacacaaca 


actggagttt tcggattaaa 


acaagattgg 


1550 


gacggagcaa 


cactaaagga 


tgcaagcwgc 


agccacacaw 


tagacccaag 


1600 


tacaatg 










1607 


<21G> 


2 










<211> 


278 










<212> 


PRT 











<213> Ehrlichia canis 
<220> 

<223> amino acid sequence of ECa28-l protein 

<400> 2 

Met Asn Cys Lys Lys lie Leu lie 
5 

Met Tyr Ser lie Pro Ser lie Ser 
20 

Gly Asn Met Gly Gly Asn Phe Tyr 
35 

Ser Val Ser His Phe Gly Ser Phe 
50 

Ser Thr Val Gly Val Phe Gly Leu 
65 

Pro lie Leu Lys Asn Lys His Ala 
80 

Ser Phe Arg Tyr Glu Asn Asn Pro 
95 

He Gly Tyr Ser Met Gly Gly Pro 
110 



Thr ::.r Ala Leu He Ser Leu 

10 15 
Phe Ser Asp Thr He Gin Asp 

25 30 
He Ser Gly Lys Tyr Val Pro 

40 45 
Ser Ala Lys Glu Glu Ser Lys 

55 60 
Lys His Asp Trp Asp Gly Ser 

70 75 
Asp Phe Thr Val Pro Asn Tyr 

85 90 
Phe Leu Gly Phe Ala Gly Ala 
100 105 
Arg He Glu Phe Glu He Ser 
115 120 



SEQ 2/24 



wo 00/32745 PCTAJS99/28075 

Tyr Glu Ala Phe Asp Val Lys Ser Pro Asn lie Asn Tyr Gin Asn 

125 130 135 

Asp Ala His Arg Tyr Cys Ala Leu Ser Kis His Thr Ser Ala Ala 

140 * ' '■'.145 150 

Met Glu Ala Asp Lys Phe Val Phe Leu Lys Asn. Glu Gly Leu lie 

155 160 165 

Asp He Ser Leu Ala He Asn Ala Cys Tyr Asp lie He Asn Asp 

170 175 180 

Lys Val Pro Val Ser Pro Tyr He Cys Ala Gly He Gly Thr Asp 

185 190 195 

Leu He Ser Met Phe Glu Ala Thr Ser Pro Lys He Ser Tyr Gin 

200 205 210 

Gly Lys Leu Gly He Ser Tyr Ser He Asn Pro Glu Thr Ser Val 

215 220 225 

Phe He Gly Gly His Phe His Arg He He Gly Asn Glu Phe Arg 

230 235 240 

Asp lie Pro Ala He Val Pro Ser Asn Ser Thr Thr He Ser Gly 

245 250 255 

Pro Gin Phe Ala Thr Val Thr Leu Asn Val Cys His Phe Gly Leu 

260 265 270 

Glu Leu Gly Gly Arg Phe Asn Phe 

275 



<210> 


3 


<211> 


849 


<212> 


DNA 


<213> 


Ehrlichia canis 


<220> 




<221> 


mat_peptide 


<223> 


nucleic acid sequence 


<400> 


3 



atgaattgta aaaaagcttt cacaataagt gcattgatat catccatata 50 

cttcctacct aatgtctcat actctaaccc agtatatggt aacagtatgt 100 

atggtaattt ttacatatca ggaaagtaca tgccaagtgt tcctcatttt 150 

ggaatttttt cagctgaaga agagaaaaaa aagacaactg tagtatatgg 200 

cttaaaagaa aactgggcag gagatgcaat atctagtcaa agtccagatg 250 

ataattttac cattcgaaat tactcattca agtatgcaag caacaagttt 300 



SEQ 3/24 



wo 00/32745 PCT/US99/28075 

ttagggtttg cagcagctat Cggctactcg ataggcagtc caagaacaga .350 
agttgagatg tcctatgaag catctgatgt gaaaaatcca ggtgataact 400 
acaaaaacgg tgcttacagg tattgtgctt tatctcatca agatgacgcg 450 
gatgatgaca tgaccagtgc aaccgacaaa tttgtatatt taattaatga 500 
aggattactt aacatatcat ttatgacaaa catatgttat gaaacagcaa 550 
gcaaaaatat acctctctct ccttacatat gtgcaggtat tggtacCgat 600 
ttaattcaca tgtttgaaac cacacatcct aaaatttctt atcaaggaaa 650 
gctagggttg gcctacttcg taagtgcaga gtcttcggtt tcttttggta 700 
tatattttca taaaattata aataataagt ttaaaaatgt tccagccatg 750 
gtacctatta actcagacga gatagtagga ccacagtttg caacagtaac 800 
attaaatgta tgctactttg gattagaact tggatgtagg ttcaacttc 849 



<210> 4 

<211> 283 

<2i2> PRT 

<213> Ehrlichia canis 

<220> 

<223> amino acid sequence of ECa28SA2 protein 

<400> 4 

Met Asn Cys Lys Lys Val Phe Thr lie Ser Ala Leu lie Ser Ser 

5-10 15 
lie Tyr Phe Leu Pro Asn Val Ser Tyr Ser Asn Pro Val Tyr Gly 

20 25 30 

Asn Ser Met Tyr Gly Asn Phe Tyr lie Ser Gly Lys. Tyr Met Pro 

35 40 45 

Ser Val Pro His Phe Gly lie Phe Ser Ala Glu Glu Glu Lys Lys 

50 55 60 

Lys Thr Thr Val Val Tyr Gly Leu Lys Glu Asn Trp Ala Gly Asp 

65 70 75 

Ala lie Ser Ser Gin Ser Pro Asp Asp Asn Phe Thr He Arg Asn 

80 85 90 

Tyr Ser Phe Lys Tyr Ala Ser Asn Lys Phe Leu Gly Phe Ala Val 

95 100 105 

Ala He Gly Tyr Ser He Gly Ser Pro Arg He Glu Val Glu Met 

110 115 120 

Ser Tyr Glu Ala Phe Asp Val Lys Asn Pro Gly Asp Asn Tyr Lys 

125 130 135 



SEQ4/24 



wo 00/32745 PCr/US99/28075 

Asn Gly Ala Tyr Arg Tyr Cys Ala Leu Ser His Gin Asp Asp Ala 

140 145 150 

Asp Asp Asp Met Thr Ser Ala Thr Asp Lys Phe Val Tyr Leu lie 

155 160 165 

Asn Glu Gly Leu Leu Asn lie Ser Phe Met Thr Asn lie Cys Tyr 

170 175 180 

Glu Thr Ala Ser Lys Asn lie Pro Leu Ser Pro Tyr lie Cys Ala 

185 190 195 

Gly He Gly Thr Asp Leu He His Met Phe Glu Thr Thr His Pro 

200 205 210 

Lys He Ser Tyr Gin Gly Lys Leu Gly Leu Ala Tyr Phe Val Ser 

215 220 225 

Ala Glu Ser Ser Val Ser Phe Gly lie Tyr Phe His Lys He He 

230 235 240 

Asn Asn Lys Phe Lys Asn Val Pro Ala Met Val Pro He Asn Ser 

245 250 255 

Asp Glu He Val Gly Pro Gin Phe Ala Thr Val Thr Leu Asn Val 

260 265 270 

Cys Tyr Phe Gly Leu Glu Leu Gly Cys Arg Phe Asn Phe 

275 280 

<210> 5 

<211> 840 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<221> mat_peptide 

<223> nucleic acid sequence of ECa28SA3 

<400> 5 



atgaattgca 
ctatgctcca 
gtagcttcta 
gttttctcag 
aaaacatgat 
tattcacagt 
gggtttgcag 
tgaagttctg 



aaaaaattct 
agcatatctt 
catcagtgga 
ctaaagaaga 
tggaatggag 
tcaaaattat 
gage tat tgg 
tacgagacat 



tataacaact 
tttctgatac 
aaatatgtac 
aagaaactca 
gtacaatatc 
tcgtttaaat 
ttattcaatg 
tcgatgtgaa 



gcattaatgt 
tatacaagac 
caagtgtttc 
actgttggag 
taactcttct 
acgaaaacaa 
ggtggcccaa 
aaatcagaac 



cattaatgta 50 
gataacactg 100 
acattttggt 150 
tttttggatt 200 
ccagaaaaca 250 
cccattccta 300 
gaatagaact 350 
aataattata 400 



SEQ 5/24 



WO00O2745 PCT/US99/28075 

agaacggcgc acacagatac tgtgctrtat ctcatcatag ttcagcaaca 450 
agcatgtcct ccgcaagtaa caaatttgtt ttcttaaaaa atgaagggtt 500 
aattgactta tcatttacga taaatgcacg ctatgacaca ataattgaag 550 
gaatgccttt ttcaccttat atttgtgcag gtgttggtac tgatgttgtt 600 
tccatgtttg aagctataaa tcctaaaatt tcttaccaag gaaaactagg 650 
. attaggttat agtataagtt cagaagcctc tgtttttatc ggtggacacc 700 
ttcacagagt cataggtaat gaatttagag acatccctgc tatggttcct 750 
agtggatcaa atcttccaga aaaccaattt gcaatagtaa cactaaatgt 800 
gtgtcacttt ggcatagaac ttggagcaag atttaacttc 840 

<210> 6 

<211> 280 

<212> PRT 

<213> Ehrlichia canis 

<220> 

<223> amino acid sequence of ECa28SA3* protein 

<400> 6 

Met Asn Cys Lys Lys lie Leu lie Thr Thr Ala Leu Met Ser Leu 

5 10 15 

Met Tyr Tyr Ala Pro Ser lie Ser Phe Ser Asp Thr lie Gin Asp 

20 25 30 

Asp Asn Thr Gly Ser Phe Tyr lie Ser Gly Lys Tyr Val Pro Ser 

35 40 45 

Val Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asn Ser 

50 55 60 

Thr Val Gly Val Phe Gly Leu Lys His Asp Trp Asn Gly Gly Thr 

65 70 75 

lie Ser Asn Ser Ser Pro Glu Asn lie Phe Thr Val Gin Asn Tyr 

80 85 90 

Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 

He Gly Tyr Ser Met Gly Gly Pro Arg He Glu Leu Glu Val Leu 

110 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Asn Asn Asn Tyr Lys Asn 

125 130 135 

Gly Ala His Arg Tyr Cys Ala Leu Ser His His Ser Ser Ala Thr 

140 145 150 



SEQ 6/24 



WO00/J2745 PCT/US99/28075 

Ser Met Ser Ser Ala Ser Asn Lys Phe Val ?he Leu Lys Asn Glu 

155 160 165 

Gly Leu lie Asp Leu Ser Phe Met lie Asn Ala Cys Tyr Asp lie 

170 175 180 

lie lie Glu Gly Met Pro Phe Ser Pro Tyr lie Cys Ala Gly Val 

185 190 195 

Gly Thr Asp Val Val Ser Met Phe Glu Ala lie Asn Pro Lys lie 

200 205 210 

Ser Tyr Gin Gly Lys Leu Gly Leu Gly Tyr Ser lie Ser Ser Glu 

215 220 225 

Ala Ser Val Phe He Gly Gly His Phe His Arg Val He Gly Asn 

230 235 240 

Glu Phe Arg Asp He Pro Ala Met Val Pro Ser Gly Ser Asn Leu 

245 250 255 

Pro Glu Asn Gin Phe Ala He Val Thr Leu Asn Val Cys His Phe 

260 265 270 

Gly He Glu Leu Gly Gly Arg Phe Asn Phe 

275 280 



<210> 


7 


<211> 


133 


<212> 


PRT 


<213> 


Ehrlichia canis 


<220> 




<223> 


partial amino acid 


<400> 


7 



Met Asn Cys Lys Lys Val Phe Thr He Ser Ala Leu He Ser Ser 

5 10 15 

He Tyr Phe Leu Pro Asn Val Ser Tyr Ser Asn Pro Val Tyr Gly 

20 25 30 

Asn Ser Met Tyr Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Val Pro His Phe Gly He Phe Ser Ala Glu Glu Glu Lys Lys 

50 55 60 

Lys Thr Thr Val Val Tyr Gly Leu Lys Glu Asn Trp Ala Gly Asp 

65 70 75 

Ala He Ser Ser Gin Ser Pro Asp Asp Asn Phe Thr He Arg Asn 

80 85 90 



SEQ 7/24 



wo 00/32745 PCT/US99/28075 

Tyr Ser Phe Lys Tyr Ala Ser Asn Lys Phe Leu Gly ?he Ala Val 

95 100 105 

Ala lie Gly Tyr Ser lie Gly Ser Pro Arg lie Glu Val Glu Met 

IIG 115 120 

Ser Tyr Glu Ala Phe Asp Val Lys Asn Gin Gly Asn Asn 

125 13 0 



<210> 


8 


<211> 


287 


<212> 


PRT 


<213> 


Ehrlichia canis . 


<220> 




<223> 


amino acid secjuence 


<400> 


8 



Met Lys Tyr Lys Lys Thr Phe Thr Val Thr Ala Leu Val Leu Leu 

5 10 15 

Thr Ser Phe Thr His Phe lie Pro Phe Tyr Ser Pro Ala Arg Ala 

20 25 30 

Ser Thr lie His Asn Phe Tyr lie Ser Gly Lys Tyr Met Pro Thr 

35 40 45 

Ala Ser His Phe Gly He Phe Ser Ala Lys Glu Glu Gin Ser Phe 

50 55 60 

Thr Lys Val Leu Val Gly Leu Asp Gin Arg Leu Ser His Asn He 

65 70 75 

He Asn Asn Asn Asp Thr Ala Lys Ser Leu Lys Val Gin Asn Tyr 

80 85 90 

Ser Phe Lys Tyr Lys Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 

He Gly Tyr Ser He Gly Asn Ser Arg He Glu Leu Glu Val Ser 

110 115 120 

His Glu He Phe Asp Thr Lys Asn Pro Gly Asn Asn Tyr Leu Asn 

125 130 135 

Asp Ser His Lys Tyr Cys Ala Leu Ser His Gly Ser His He Cys 

140 145 150 

Ser Asp Gly Asn Ser Gly Asp Trp Tyr Thr Ala Lys Thr Asp Lys 

155 160 165 

Phe Val Leu Leu Lys Asn Glu Gly Leu Leu Asp Val Ser Phe Met 

170 175 180 



SEQ 8/24 



wo 00/32745 



PCT/US99A28075 



Leu Asn Ala Cys Tyr Asp lie Thr Thr Glu Lys Met Pro Phe Ser 

185 190 195 

Pro Tyr lie Cys Ala Gly lie Gly Thr Asp Leu lie Ser Met Phe 

200 205 210 

Glu Thr Thr Gin Asn Lys lie Ser Tyr Gin Gly Lys Leu Gly Leu 

215 220 225 

Asn Tyr Thr lie Asn Ser Arg Val Ser Val Phe Ala Gly Gly His 

230 235 240 

Phe His Lys Val lie Gly Asn Glu Phe Lys Gly lie Pro Thr Leu 

245 250 255 

Leu Pro Asp Gly Ser Asn lie Lys Val Gin Gin Ser Ala Thr Val 

260 265 270 

Thr Leu Asp Val Cys His Phe Gly Leu Glu He Gly Ser Arg Phe 

275 280 285 

Phe Phe 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 

Met Asn Tyr 
lie Ser Ser 
Gly He Asn 
Ala Ser His 
Thr Val Gly 
He Ser Asn 
Ser Phe Lys 



9 

281 
PRT 

Ehrlichia chaffeensis 

amino acid sequence of E. chaffeensis P28 
9 

Lys Lys Val Phe He Thr Ser Ala Leu He Ser Leu 

5 10 15 

Leu Pro Gly Val Ser Phe Ser Asp Pro Ala Gly Ser 

20 25 30 

Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro Ser 

35 40 45 

Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asn Thr 

50 55 60 

Val Phe Gly Leu Lys Gin Asn Trp Asp Gly Ser Ala 

65 70 75 

Ser Ser Pro Asn Asp Val Phe Thr Val Ser Asn Tyr 

80 85 90 

Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 

95 100 105 



SEQ 9/24 



wo 00/32745 



PCT/US99/28075 



He Gly Tyr Ser Met Asp Gly Pro Arg He Glu Leu Glu Val Ser 

110 115 " 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys Asn 

125 130 135 

Glu Ala His Arg Tyr Cys Ala Leu Ser His Asn Ser Ala Ala Asp 

140 145 150 

Met Ser Ser Ala Ser Asn Asn Phe Val Phe Leu Lys Asn Glu Gly 

155 160 165 

Leu Leu Asp He Ser Phe Met Leu Asn Ala Cys Tyr Asp Val Val 

170 175 180 

Gly Glu Gly He Pro Phe Ser Pro Tyr He Cys Ala Gly He Gly 

185 190 195 

Thr Asp Leu Val Ser Met Phe Glu Ala Thr Asn Pro Lys He Ser 

200 205 210 

Tyr Gin Gly Lys Leu Gly Leu Ser Tyr Ser He Ser Pro Glu Ala 

215 220 225 

Ser Val Phe He Gly Gly His Phe His Lys Val He Gly Asn Glu 

230 235 240 

Phe Arg Asp He Pro Thr He He Pro Thr Gly Ser Thr Leu Ala 

245 250 255 

Gly Lys Gly Asn Tyr Pro Ala He Val He Leu Asp Val Cys His 

260 265 270 

Phe Gly He Glii' Leu Gly Gly Arg Phe Ala Phe 

275 280 



<210> 


10 


<211> 


283 


<212> 


PRT 


<213> 


Ehrlichia chaffeensis 


<220> 




<223> 


amino acid sequence of 


<400> 


10 



Met Asn Tyr Lys Lys lie Phe Val Ser Ser Ala Leu lie Ser Leu 
5 10 15 

Met Ser lie Leu Pro Tyr Gin Ser Phe Ala Asp Pro Val Thr Ser 
20 25 30 



SEQ 10/24 



wo 00/32745 PCT/US99/28075 

Asn Asp Thr Gly lie Asn Asp Ser Arg Glu Gly Phe Tyr lie Ser 

35 40 45 

Val Lys Tyr Asn Pro Ser lie Ser His Phe Arg Lys Phe Ser Ala 

50 55 60 

Glu Glu Ala Pro lie Asn Gly Asn Thr Ser lie Thr Lys Lys Val 

65 70 75 

Phe Gly Leu Lys Lys Asp Gly Asp lie Ala Gin Ser Ala Asn Phe 

80 85 90 

Asn . Arg Thr Asp Pro Ala Leu Glu Phe Gin Asn Asn Leu lie Ser 

95 100 105 

Gly Phe Ser Gly Ser lie Gly Tyr Ala Met Asp Gly Pro Arg lie 

110 115 120 

Glu Leu Glu Ala Ala Tyr Gin Lys Phe Asp Ala Lys Asn Pro Asp 

125 130 135 

Asn Asn Asp Thr Asn. Ser Gly Asp Tyr Tyr Lys Tyr Phe Gly Leu 

140 145 150 

Ser Arg Glu Asp Ala lie Ala Asp Lys Lys Tyr Val Val Leu Lys 

155 160 165 

Asn Glu Gly lie Thr Phe Met Ser Leu Met Val Asn Thr Cys Tyr 

170 175 180 

Asp lie Thr Ala Glu Gly Val Pro Phe lie Pro Tyr Ala Cys Ala 

185 190 195 

Gly Val Gly Ala Asp Leu lie Asn Val Phe Lys Asp Phe Asn Leu 

200 205 210 

Lys Phe Ser Tyr Gin Gly Lys lie Gly lie Ser Tyr Pro lie Thr 

215 220 225 

Pro Glu Val Ser Ala Phe lie Gly Gly Tyr Tyr His Gly Val He 

230 235 240 

Gly Asn Asn Phe Asn Lys He Pro Val He Thr Pro Val Val Leu 

245 250 255 

Glu Gly Ala Pro Gin Thr Thr Ser Ala Leu Val Thr He Asp Thr 

260 265 270 

Gly Tyr Phe Gly Gly Glu Val Gly Val Arg Phe Thr Phe 

275 280 

<210> 11 
<211> 280 



SEQ 11/24 



'^O00J3274S PeT/US99/28075 

<212> PRT 

<213> Ehrlichia chaffeensis. 

<220> 

<223> amino acid sequence of E. chaffeensis OMP-IC 

<400> 11 

Met Asn Cys Lys Lys Phe Phe He Thr Thr Ala Leu Ala Leu Pro 

5 10 15 

Met Ser Phe Leu Pro Gly He Leu Leu Ser Glu Pro Val Gin Asp 

20 25 30 

Asp Ser Val Ser Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Lys Asn 

50 -55 60 

Pro Thr Val Ala Leu Tyr Gly Leu Lys Gin Asp Trp Asn Gly Val 

65 70 75 

Ser Ala Ser Ser His Ala Asp Ala Asp Phe Asn Asn Lys Gly Tyr 

80 85 90 

Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 
95 100 105 

He Gly Tyr Ser Met Gly Gly Pro Arg He Glu Phe Glu" Val Ser 
110 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Gly Asn Tyr Lys Asn 
125 130 135 

Asp Ala His Arg Tyr Cys Ala Leu Asp Arg Lys Ala Ser Ser Thr 
140 145 150 

Asn Ala Thr Ala Ser His Tyr Val Leu Leu Lys Asn Glu Gly Leu 
155 160 165 

Leu Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp Val Val Ser 
170 175 180 

Glu Gly lie Pro Phe Ser Pro Tyr He Cys Ala Gly Val Gly Thr 
185 190 195 

Asp Leu He Ser Met Phe Glu Ala He Asn Pro Lys He Ser Tyr 
200 205 210 

Gin Gly Lys Leu Gly Leu Ser Tyr Ser He Asn Pro Glu Ala Ser 
215 220 225 

Val Phe Val Gly Gly His Phe His Lys Val Ala Gly Asn Glu Phe 
230 235 240 



SEQ 12/24 



wo 00/32745 



PCT/US99/28075 



Arg Asp lie Ser Thr Leu Lys Ala Phe Ala Thr Pro Ser Ser Ala 
245 250 255 

Ala Thr Pro Asp Leu Ala Thr Val Thr Leu Ser Val Cys His Phe 
260 265 270 

Gly Val Glu Leu Gly Gly Arg Phe Asn Phe 
275 280 



<21G> 12 
<211> 286 
<212> PRT 

<213> Ehrlichia chaffeensis 

<220> 

<223> amino acid sequence of E. chaffeensis OMP-ID 

<40G> 12 

Met Asn Cys Glu Lys Phe Phe lie Thr Thr Ala Leu Thr Leu Leu 

5 10 15 

Met Ser Phe Leu Pro Gly lie Ser Leu Ser Asp Pro Val Gin Asp 

20 25 30 

Asp Asn lie Ser Gly Asn Phe Tyr lie Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asn 

50 55 60 

Thr Thr Val Gly Val Phe Gly lie Glu Gin Asp Trp Asp Arg Cys 

65 70 75 

Val lie Ser Arg Thr Thr Leu Ser Asp lie Phe Thr Val Pro Asn 

80 85 90 

Tyr Ser Phe Lys Tyr Glu Asn Asn Leu Phe Ser Gly Phe Ala Gly 

95 100 105 

Ala He Gly Tyr Ser Met Asp Gly Pro Arg He Glu Leu Glu Val 

110 115 120 

Ser Tyr Glu Ala Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys 

125 130 135 

Asn Glu Ala His Arg Tyr Tyr Ala Leu Ser His Leu Leu Gly Thr 

140 145 150 

Glu Thr Gin He Asp Gly Ala Gly Ser Ala Ser Val Phe Leu He 

155 160 165 



SEQ 13/24 



wo 00/32745 PCTAJS99/28075 

Asn .Glu Gly Leu Leu Asp Lys Ser Phe Met Leu Asn Ala Cys Tyr 

170 175 180 

Asp Vai He Ser Glu Gly He Pro Phe Ser Pro Tyr He Cys Ala 

185 190 195 

Gly He Gly He Asp Leu Vai Ser Met Phe Glu Ala He Asn Pro 

200 205 210 

Lys He Ser Tyr Gin Gly Lys Leu Gly Leu Ser Tyr Pro He Ser 

215 220 225 

Pro Glu Ala Ser Vai Phe He Gly Gly His Phe His Lys Vai He 

230 235 240 

Gly Asn Glu Phe Arg Asp He Pro Thr Met He Pro Ser Glu Ser 

245 250 255 

Ala Leu Ala Gly Lys Gly Asn Tyr Pro Ala He Vai Thr Leu Asp 

260 265 . 270 

Vai Phe Tyr Phe Gly He Glu Leu Gly Gly Arg Phe Asn Phe Gin 

275 280 285 

Leu 



<210> 13 
<211> 278 
<212> PRT 

<213> Ehrlichia chaffeensis 

<220> 

<223> amino acid sequence of E. chaffeensis OMP-IE 

<400> 13 

Met Asn Cys Lys Lys Phe Phe He Thr Thr Ala Leu Vai Ser Leu 

5 10 15 

Met Ser Phe Leu Pro Gly He Ser Phe Ser Asp Pro Vai Gin Gly 

20 25 30 

Asp Asn He Ser Gly Asn Phe Tyr Vai Ser Gly Lys Tyr Met Pro 

35 40 45 

Ser Ala Ser His Phe Gly Met Phe Ser Ala Lys Glu Glu Lys Asn 

50 55 60 

Pro Thr Vai Ala Leu Tyr Gly Leu Lys Gin Asp Trp Glu Gly He 

65 70 75 

Ser Ser Ser Ser His Asn Asp Asn His Phe Asn Asn Lys Gly Tyr 

80 85 90 



SEQ 14/24 



wo 00/32745 



PCT/US99/28075 



Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly Ala 
95 100 105 

IlG Gly Tyr Ser Met Gly Gly Pro Arg Val Glu Phe Glu Val Ser 
110 115 120 

Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys Asn 
125 130 135 

Asp Ala His Arg Tyr Cys Ala Leu Gly Gin Gin Asp Asn Ser Gly 
140 145 150 

lie Pro Lys Thr Ser Lys Tyr Val Leu Leu Lys Ser Glu Gly Leu 
155 160 165 

Leu Asp He Ser Phe Met Leu Asn Ala Cys Tyr Asp lie lie Asn 
170 175 180 

Glu Ser He Pro Leu Ser Pro Tyr He Cys Ala Gly Val Gly Thr 
185 190 195 

Asp Leu He Ser Met Phe Glu Ala Thr Asn Pro Lys He Ser Tyr 
200 205 210 

Gin Gly Lys Leu Gly Leu Ser Tyr Ser He Asn Pro Glu Ala Ser 
215 220 225 

Val Phe He Gly Gly His Phe His Lys Val He Gly Asn Glu Phe 
230 235 240 

Arg Asp He Pro Thr Leu Lys Ala Phe Val Thr Ser Ser Ala Thr 
245 250 255 

Pro Asp Leu Ala He Val Thr Leu Ser Val Cys His Phe Gly He 
260 265 270 

Glu Leu Gly Gly Arg Phe Asn Phe 
275 



<210> 


14 


<211> 


280 


<212> 


PRT 


<213> 


Ehrlichia chaffeensis 


<22Q> 




<223> 


amino acid secpjence of 


<400> 


14 



Met Asn Cys Lys Lys Phe Phe He Thr Thr Thr Leu Val Ser Leu 
5 10 15 



SEQ 15/24 



wo 00/32745 PCT/US99/28075 

Met Ser Phe Leu Pro Gly He Ser Phe Ser Asp Ala Val Gin Asn 

20 25 30 

Asp Asn Val Gly Gly Asn Phe Tyr He Ser Gly Lys Tyr Val Pro 

35 40 45 

Ser Val Ser His Phe Gly Val Phe Ser Ala Lys Gin Glu Arg Asn 

50 55 60 

Thr Thr Thr Gly Val Phe Gly Leu Lys Gin Asp Trp Asp Gly Ser 

65 .70 75 

Thr He Ser Lys Asn Ser Pro Glu Asn Thr Phe Asn Val Pro Asn 

. 80 85 90 

Tyr Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Gly 
95 100 105 

Ala Val Gly Tyr Leu Met Asn Gly Pro Arg He Glu Leu Glu Met 
110 115 120 

Ser Tyr Glu Thr Phe Asp Val Lys Asn Gin Gly Asn Asn Tyr Lys 
125 130 135 

Asn Asp Ala His Lys Tyr Tyr Ala Leu Thr His Asn Ser Gly Gly 
140 145 150 

Lys Leu Ser Asn Ala Gly Asp Lys Phe Val Phe Leu Lys Asn Glu 
155 160 165 

Gly Leu Leu Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp Val 
170 175 180 

He Ser Glu Gly He Pro Phe Ser Pro Tyr He Cys Ala Gly Val 
185 190 195 

Gly Thr Asp Leu He Ser Met Phe Glu Ala He Asn Pro Lys He 
200 205 210 

Ser Tyr Gin Gly Lys Leu Gly Leu Ser Tyr Ser He Ser Pro Glu 
215 220 225 

Ala Ser Val Phe Val Gly Gly His Phe His Lys Val He Gly Asn 
230 235 240 

Glu Phe Arg Asp He Pro Ala Met He Pro Ser Thr Ser Thr Leu 
245 250 255 

Thr Gly Asn His Phe Thr He Val Thr Leu Ser Val Cys His Phe 
260 2.65 270 

Gly Val Glu Leu Gly Gly Arg Phe Asn Phe 
275 280 



SEQ 16/24 



rVyj UU/JZ745 




<210> 


15 


<211> 


284 


<212> 


PRT 


<213> 


Cowdria ruminantium 


<220> 




<223> 


amino acid sequence 


<400> 


15 



PCT/US99/28075 



Met Asn Cys Lys Lys He Phe He Thr Ser Thr Leu He Ser Leu 

5 10 15 

Val Ser Phe Leu Pro Gly Val Ser Phe Ser Asp Val He Gin Glu 

20 25 30 

Glu Asn Asn Pro Val Gly Ser Val Tyr He Ser Ala Lys Tyr Met 

35 40 45 

Pro Thr Ala Ser His Phe Gly Lys Met Ser He Lys Glu Asp Ser 

50 55 50 

Arg Asp Thr Lys Ala Val Phe Gly Leu Lys Lys Asp Trp Asp Gly 

65 70 75 

Val Lys Thr Pro Ser Gly Asn Thr Asn Ser He Phe Thr Glu Lys 

80 85 90 

Asp Tyr Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala 
95 100 105 

Gly Ala Val Gly Tyr Ser Met Asn Gly Pro Arg He Glu Phe Glu 
110 115 120 

Val Ser Tyr Glu Thr Phe Asp Val Arg Asn Pro Gly Gly Asn Tyr 
125 130 135 

Lys Asn Asp Ala His Met Tyr Cys Ala Leu Asp Thr Ala Ser Ser 
140 145 J5Q 

Ser Thr Ala Gly Ala Thr Thr Ser Val Met Val Lys Asn Glu Asn 
155 160 165 

Leu Thr Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp He Met 
170 175 180 

Leu Asp Gly Met Pro Val Ser Pro Tyr Val Cys Ala Gly He Gly 
185 190 195 

Thr Asp Leu Val Ser Val He Asn Ala Thr Asn Pro Lys Leu Ser 
200 205 210 

Tyr Gin Gly Lys Leu Gly He Ser lyr Ser He Asn Pro Glu Ala 
215 220 225 



SEQ 17/24 
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Ser lie Phe He Gly Gly His Phe His Arg Val lie Gly Asn Glu 

230 235 240 

Phe Lys Asp He Ala Thr Ser Lys Val Phe Thr Ser Ser Giy Asn 

245 250 255 

Ala Ser Ser Ala Val Ser Pro Gly Phe Ala Ser Ala lie Leu Asp 

260 265 270 

Val Cys His Phe Gly He Glu He Gly Gly Arg Phe Val Phe 

275 280 

<210> 16 
<211> 20 
<212> DNA 

<213> artificial sequence 

<220> 

<221> priiner__bind 

<222> nucleotides 313-332 of C. ruminantium MAP-1, 

also nucleotides 307-326 of E. chaff eensis P28 
<223> forward primer 793 for PGR 

<400> 16 

gcaggagctg ttggttactc 20 

<210> 17 
<211> 21 
<212> DNA 

<213> artificial sequence 

<220> 

<221> primer_bind 

<222> nucleotides 823-843 of C. ruminantium MAP-1, 

also nucleotides 814-834 of E. chaffeensis P28 
<223> reverse primer 1330 for PGR 

<400> 17 

ccttcctcca agttctatgc c 21 

<210> 18 
<211> 24 
<212> DNA 



SEQ 18/24 
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<213> artificial sequence 
<220> 

<221> primer_bind 

<223> primer 46f , specific for ECa28SA2 gene 

<400> 18 

atatacttcc tacctaatgt ctca 24 

<210> 19 

<211> 20 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_bind 

<223> primer used for isequencing 28-kDa protein genes 

in E. canis 

<400> 19 

agtgcagagt cttcggtttc 20 

<210> 20 

<211> 18 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_bind 

<223> primer used for sequencing 28-kDa protein genes 

in E. canis 

<400> 20 

gttacttgcg gaggacat 18 

<210> 21 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_band 

<222> nucleotides 687-710 of ECa2d~l 



SEQ 19/24 
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<223> primer 3 94 for PGR 

<400> 21 



gcatttccac aggatcatag gtaa 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
<400> 

ttacctatga tcctgtggaa atgc 



24 



24 



22 
24 
DNA 

artificial sequence 
primer_band 

nucleotides 710-687 of ECa2a-l 
primer 394C for PGR 
22 



<210> 23 - 

<211> 20 
<212> DNA 

<213> artificial sequence 

<220> 

<221> priiner_bind 

<223> primer 793C which anneals to a region with 

Eca28-1, used to amplify the intergenic 
region between gene ECa28SA3 and ECa28'l 

<400> 23 

gagtaaccaa cagctcctgc 20 

<210> 24 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_band 
<222> 

<223> primer EC280M-F complementary to noncoding 
regions adjacent to the open reading frame 



SEQ 20/24 
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of ECa28-l 
<400> 24 

tctactttgc acttccacta ttgt 24 

<210> 25 
<211> 24 
<212> DNA 

<213> artificial sequence 
<220> 

<221> primer_band 

<223> primer EC280M-R complementary to noncoding 

regions adjacent to the open reading frame 
of ECa28'l 

<400> 25 

attcttttgc cactattttt cttt 24 



<210> 26 
<211> 25 
<212> DNA 

<213> artificial sequence 

<220> 

<221> primer_bind 

<223> primer ECaSA3-2 corresponding to regions within 

ECa28SA3, us^d to amplify the intergenic region 
NC3 between gene ECa28SA3 and ECa28-l 

<400> 26 

ctaggattag gttatagtat aagtt 25 

27 
23 
PRT 

Ehrlichia canis 
PEPTIDE 

a predicted N-terminal signal peptide of ECa28-l 



<210> 
<211> 
<212> 

<213> 
<220> 
<221> 
<223> 



SEQ 21/24 
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and ECa28SA3 
<400> 27 

Met Asn Cys Lys Lys lie Leu lie Thr Thr Ala Leu Met Ser Leu 
5 10 15 

Met Tyr Tyr Ala Pro Ser lie Ser 
20 

<210> 28 
<211> 25 
<212> PRT 

<213> Ehrlichia chaffeensis 

<220> 

<223> amino acid sequence of N- terminal signal peptide 

of E. chaffeensis P28 
<400> 28 

Met Asn Tyr Lys Lys lie Leu lie Thr Ser Ala Leu lie Ser Leu 

5 10 15 

lie. Ser Ser Leu Pro Gly Val Ser Phe Ser 

20 25 



<210> 29 

<211> 26 

<212> PRT 

<213> Ehrlichia canis 

<220> 

<223> amino acid sequence of putative cleavage site of 

ECa28'l 
<400> 29 

Met Asn Cys Lys Lys lie Leu He Thr Thr Ala Leu He Ser Leu 
5 10 15 

Met Tyr Ser He Pro Ser He Ser Ser Phe Ser 
20 25 

<210> 30 
<211> 299 



SEQ 22/24 
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<212> 



DMA 



<213> 



Ehrlichia canis 



<220> 



<223> 



nucleic acid sequence of intergenic noncoding 
region 1 (28NC1) 



<400> 



30 



taatacttct attgtacatg ttaaaaatag tactagcttg cttctgtggt 50 
ttataaacgc aagagagaaa tagttagtaa taaattagaa agttaaatat 100 
tagaaaagtc atatgttttt cattgtcatt gatactcaac taaaagtagt 150 
ataaatgcta cttattaata attttacgta gtatattaaa tttcccttac 200 
aaaagccact agtattttat actaaaagct atactttggc ttgtatttaa 250 
tttgtatttt tactactgtt aatttacttt cactgtttct ggtgtaaat 299 

<210> 31 

<211> 345 

<212> DNA 

<213> Ehrlichia canis 



taatttcgtg gtacacatat cacgaagcta aaattgtttt tttatctctg 50 
ctgtatacaa gagaaaaaat agtagtgaaa attacctaac aatatgacag 100 
tacaagttta ccaagcttat tctcacaaaa cttcttgtgt cttttatctc 150 
tttacaatga aatgtacact tagcttcact actgtagagt gtgtttatca 200 
atgctttgtt tactaatact ctacataata tgttaaattt ttcttacaaa 250 
actcactagt aatttatact agaatatata ttctgacttg tatttgcttt 300 
atacttccac tattgttaat ttattttcac tattttaggt gtaat 345 

<210> 32 

<211> 345 

<212> DNA 

<213> Ehrlichia canis 



<220> 



<400> 



<223> 



nucleic acid sequence of intergenic noncoding 

region 2 (28NC2) 

31 



<220> 



<223> 



nucleic acid sequence of intergenic noncoding 
region 3 {28NC3) 



<400> 



32 



SEQ 23/24 
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tcj^ttttatt: gttgccacat actaaaaatg atctaaactt gtttttatta 50 
ttgctacata caaaaaaaag aaaaatagtg gcaaaagaat gtagcaataa 100 
gagggggggg ggggactaaa tttaccttct attcttctaa caccctttac 150 
tatattcaaa tagcacaacc caatgcttcc aggaaaatat gctcctaata 200 
ttttatttat taccaatcct tatataatat attaaatttc ccttacaaaa 250 
atctctaatg ctttatactt aatatatata ttctggcttg tatctacttt 300 
gcacttccac tattgttaat ttattttcac tattttaggt gtaat 345 

<210> 33 

<211> 355 

<212> DNA 

<213> Ehrlichia canis 

<220> 

<223> nucleic acid sequence of intergenic noncodiiig 

region 4 {28NC4) 
<400> 33 

taattttatt gttgccacat attaaaaatg atctaaactt gtttttawta 50 
ttgctacata caaaaaaaga aaaatagtgg caaaagaatg tagcaataag 100 
aggggggggg gggaccaaat ttatcttcta tgcttcccaa gttttttcyc 150 
gctatttatg acttaaacaa cagaaggtaa tatcctcacg gaaaacttat 200 
cttcaaatat tttatttatt accaatctta tataatatat taaatttctc 250 
ttacaaaaat cactagtatt ttataccaaa atatatattc tgacttgctt 300 
ttcttctgca cttctactat ttttaattta tttgtcacta ttaggttata 350 
ataawr 355 



SEQ 24/24 



